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I.     THE  LAW  OF  CAUSALITY. 

§  1.  We  start  with  the  assumption  that  everything  that  exists,  and  everything 
that  happens,  exists  or  happens  as  a  necessary  consequence  of  a  previous  state  of  things. 
If  a  state  of  things  is  repeated  in  every  detail,  it  must  lead  to  exactly  the  same  consequences. 
Any  difference  hetween  the  results  of  causes  that  are  in  part  the  same,  must  be  explainable 
by  some  difference  in  the  other  part  of  the  causes. 

This  assumption,  which  may  be  called  the  law  of  causality,  cannot  be  proved,  but 
must  be  believed;  in  the  same  way  as  we  believe  the  fundamental  assumptions  of  religion, 
with  which  it  is  closely  and  intimately  connected.  The  law  of  causality  forces  itself  upon 
our  belief.  It  may  be  denied  in  theory,  but  not  in  practice.  Any  person  who  denies  it, 
will,  if  he  is  watchful  enough,  catch  himself  constantly  asking  himself,  if  no  one  else,  why 
this  has  happened,  and  not  that.  But  in  that  very  question  he  bears  witness  to  the  law 
of  causality.  If  we  are  consistently  to  deny  the  law  of  causality,  we  must  repudiate  all 
observation,  and  particularly  all  prediction  based  on  past  experience,  as  useless  and  misleading. 

If  we  could  imagine  for  an  instant  that  the  same  complete  combination  of  causes 
could  have  a  definite  number  of  different  consequences,  however  small  that  number  might 
be,  and  that  among  these  the  occurrence  of  the  actual  consequence  was,  in  the  old  sense 
of  the  word,  accidental,  no  observation  would  ever  be  of  any  particular  value.  Scientific 
observations  cannot  be  reconciled  with  polytheism.  So  long  as  the  idea  prevailed  that  the 
result  of  a  journey  depended  on  whether  the  power  of  Njord  or  that  of  Skade  was  the 
stronger,  or  that  victory  or  defeat  in  battle  depended  on  whether  Jove  had,  or  had  not, 
listened  to  Juno's  complaints,  so  long  were  even  scientists  obliged  to  consider  it  below  their 
dignity  to  consult  observations. 

But  if  the  law  of  causality  is  acknowledged  to  be  an  assumption  which  always 
holds  good,  then  every  observation  gives  us  a  revelation  which,  when  correctly  appraised 
and  compared  with  others,  teaches  us  the  laws  by  which  God  rules  the  world. 

We  can  judge  of  the  far-reaching  consequences  it  would  have,  if  there  were  con- 
ditions in  which  the  law  of  causality  was  not  valid  at  all,  by  considering  the  cases  in 
which  the  effects  of  the  law  are  more  or  less  veiled. 


In  inanimate  nature  the  relation  of  cause  and  effect  is  so  clear  that  the  effects  are 
determined  by  observable  causes  belonging  to  the  condition  immediately  preceding,  so  that 
the  problem,  within  this  domain,  may  be  solved  by  a  tabular  arrangement  of  the  several 
observed  results  according  to  the  causing  circumstances,  and  the  transformation  of  the 
tables  into  laws  by  means  of  interpolation.  When,  however,  living  beings  are  the  object 
of  our  observations,  the  case  immediately  becomes  more  complicated. 

It  is  the  prerogative  of  living  beings  to  hide  and  covertly  to  transmit  the  influ- 
ences received,  and  we  must  therefore  within  this  domain  look  for  the  influencing  causes 
throughout  the  whole  of  the  past  history.  A  difference  in  the  construction  of  a  single 
cell  may  be  the  only  indication  present  at  the  moment  of  the  observation  that  the  cell  is 
a  transmitter  of  the  still  operative  cause,  which  may  date  from  thousands  of  years  back. 
In  consequence  of  this  the  naturalist,  the  physiologist,  the  physician,  can  only  quite  ex- 
ceptionally attain  the  same  simple,  definite,  and  complete  accordance  between  the  observed 
causes  and  their  effects,  as  can  be  attained  by  the  physicist  and  the  astronomer  within 
their  domains. 

Within  the  living  world,  communities,  particularly  human  ones,  form  a  domain 
where  the  conditions  of  the  observations  are  even  more  complex  and  difficult.  Living 
beings  hide,  but  the  community  deceives.  For  though  it  is  not  in  the  power  of  the  com- 
munity either  to  change  one  tittle  of  any  really  divine  law,  or  to  break  the  bond  between 
cause  and  effect,  yet  every  community  lays  down  its  own  laws  also.  Every  community 
tries  to  give  its  law  fixity,  and  to  make  it  operate  as  a  cause;  for  instance,  by  passing  it 
off  as  divine  or  by  threats  of  punishment;  but  nevertheless  the  laws  of  the  community 
are  constantly  broken  and  changed. 

Statistical  Science  which,  in  the  case  of  communities,  represents  observations,  has 
therefore  a  very  difficult  task;  although  the  observations  are  so  numerous,  we  are  able  from 
them  alone  to  answer  only  a  very  few  questions  in  cases  where  the  intellectual  weapons  of 
historical  and  speculative  criticism  cannot  assist  in  the  work,  by  independently  bringing  to 
light  the  truths  which  the  communities  want  to  conceal,  and  on  the  other  hand  by  re- 
moving the  wrong  opinions  which  these  believe  in  and  propagate. 

§  2.  An  isolated  sensation  teaches  us  nothing,  for  it  does  not  amount  to  an  ob- 
servation. Observation  is  a  putting  together  of  several  results  of  sensation  which  are  or 
are  supposed  to  be  connected  with  each  other  according  to  the  law  of  causality,  so  that 
some  represent  causes  and  others  their  effects. 

By  virtue  of  the  law  of  causality  we  must  believe  that,  in  all  observations,  we  get 
essentially  correct  and  true  revelations;  the  difficulty  is,  to  ask  searchingly  enough  and  to 
understand  the  answer  correctly.  In  order  that  an  observation  may  be  free  from  every 
other  assumption  or  hypothesis  than  the  law  of  causality,  it  must  include  a  perfect 
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of  all  the  circumstances  in  the  world,  at  least  at  the  instant  pn-< -eding  that  at 
\\liich  the  phenomenon  is  observed.  Mut  it  is  clear  thai  this  far  surpasses  what  can  be  done, 
even  in  the  most  important  cases.  Real  observations  have  a  much  simpler  form.  By  giving 
a  short  statement  of  the  time  and  place  of  observation,  we  refer  to  what  is  known  of  the 
state  of  things  at  the  instant;  and,  of  the  inlinite  multiplicity  of  circumstances  connected 
with  the  observation  we,  generally,  not  only  disregard  everything  which  may  be  supposed  to 
have,  little  or  no  influence,  but  we  pay  attention  only  to  a  small  selection  of  circumstances, 
\shieh  we  call  essential,  because  we  expect,  in  virtue  of  a  special  hypothesis  concerning 
the  relation  of  cause  and  effect,  that  the  observed  phenomenon  will  be  effect  of  these 
circumstances  only. 

Nay,  we  are  often  compelled  to  disregard  certain  circumstances  as  unessential. 
though  there  is  no  doubt  as  to  their  influencing  the  phenomenon;  and  we  do  this  either 
because  we  cannot  get  a  sufficient  amount  of  trustworthy  information  regarding  them,  or 
because  it  would  be  impracticable  to  trace  out  their  connection  with  the  effect.  For 
instance  in  statistical  observations  on  mortality,  where  the  age  at  the  time  of  death  can 
be  regarded  as  the  observed  phenomenon,  we  generally  mention  the  sex  as  an  essential 
circumstance,  and  often  give  a  general  statement  as  to  residence  in  town  or  country,  or  as 
to  occupation.  But  there  are  other  things  as  to  which  we  do  not  get  sufficient  information: 
whether  the  dead  person  has  lived  in  straitened  or  in  comfortable  circumstances,  whether 
he  has  been  more  or  less  exposed  to  infectious  disease,  etc. ;  and  we  must  put  up  with  this, 
even  if  it  is  certain  that  one  or  other  of  these  things  was  the  principal  cause  of  death. 
And  analogous  cases  are  frequently  met  with  both  in  scientific  observations  and  in  everyday 
occurrences. 

In  order  to  obtain  a  perfect  observation  it  is  necessary,  moreover,  that  our  sensations 
should  give  us  accurate  information  regarding  both  the  phenomenon  and  the  attendant 
circumstances;  but  all  our  senses  may  be  said  to  give  us  merely  approximate  descriptions 
of  any  phenomenon  rather  than  to  measure  it  accurately.  Even  the  finest  of  our  senses 
recognizes  no  difference  which  falls  short  of  a  certain  finite  magnitude.  This  lack  of 
accuracy  is,  moreover,  often  greatly  increased  by  the  use  of  arbitrary  round  numbers 
for  the  sake  of  convenience.  The  man  who  has  to  measure  a  race-course,  may  take  into 
account  the  odd  metres,  but  certainly  not  the  millimetres,  not  to  mention  the  microns. 

§  3.  Owing  to  all  this,  every  actual  observation  is  affected  with  errors.  Even  our 
best  observations  are  based  upon  hypothesis,  and  often  even  on  an  hypothesis  that  is  cer- 
tainly wrong,  namely,  that  only  the  circumstances  which  are  regarded  as  essential,  influence 
the  phenomenon;  and  a  regard  for  practicability,  expense,  and  convenience  makes  us  give 
approximate  estimates  instead  of  the  sharpest  possible  determinations. 

Now  and  then  the  observations  are  affected  also  by  gross  errors  which,   although 


not  introduced  into  them  on  purpose,  are  yet  caused  by  such  carelessness  or  neglect  that 
they  could  have  been,  and  ought  to  have  been,  avoided.  1  contradistinction  to  these  we 
often  call  the  more  or  less  unavoidable  errors  accidental.  For  accident  (or  chance)  is  not, 
what  the  word  originally  meant,  and  what  still  often  lingers  in  our  ordinary  acceptation 
of  it,  a  capricious  power  which  suffers  events  to  happen  without  any  cause,  but  only  a 
name  for  the  unknown  element,  involved  in  some  relation  of  cause  and  effect,  which  pre- 
vents us  from  fully  comprehending  the  connection  between  them.  When  we  say  that  it 
is  accidental,  whether  a  die  turns  up  "six"  or  "three",  we  only  mean  that  the  circumstances 
connected  with  the  throwing,  the  fall,  and  the  rolling  of  the  die  are  so  manifold  that  no 
man,  not  even  the  cleverest  juggler  and  arithmetician  united  in  the  same  person,  can  suc- 
ceed in  controlling  or  calculating  them. 

In  many  observations  we  reject  as  unessential  many  circumstances  about  which  we 
really  know  more  or  less.  We  may  be  justified  in  this;  but  if  such  a  circumstance  is  of 
sufficient  importance  as  a  cause,  and  we  arrange  the  observations  with  special  regard  to 
it,  we  may  sometimes  observe  that  the  errors  of  the  observations  show  a  regularity  which 
is  not  found  in  "accidental"  errors.  The  same  may  be  the  case  if,  in  computations  dealing 
with  the  results  of  observations,  we  make  a  wrong  supposition  as  to  the  operation  of  some 
circumstance.  Such  errors  are  generally  called  systematic. 

§  4.  It  will  be  found  that  every  applied  science,  which  is  well  developed,  may  be 
divided  into  two  parts,  a  theoretical  (speculative  or  mathematical)  part  and  an  empirical 
(observational)  one.  Both  are  absolutely  necessary,  and  the  growth  of  a  science  depends 
very  much  on  their  influencing  one  another  and  advancing  simultaneously.  No  lasting 
divergence  or  subordination  of  one  to  the  other  can  be  allowed. 

The  theoretical  part  of  the  science  deals  with  what  we  suppose  to  be  accurate 
determinations,  and  the  object  of  its  reasonings  is  the  development  of  the  form,  connection, 
and  consequences  of  the  hypotheses.  But  it  must  change  its  hypotheses  as  soon  as  it  is 
clear  that  they  are  at  variance  with  experience  and  observation. 

The  empirical  side  of  the  science  procures  and  arranges  the  observations,  compares 
them  with  the  theoretical  propositions,  and  is  entitled  by  means  of  them  to  reject,  if 
necessary,  the  hypotheses  of  the  theory.  By  induction  it  can  deduce  laws  from  the  obser- 
vations. But  it  must  not  forget  --  though  it  may  have  a  natural  inclination  to  do  so  - 
that,  as  shown  above,  it  is  itself  founded  on  hypotheses.  The  very  form  of  the  observation, 
and  especially  the  selection  of  the  circumstances  which  are  to  be  considered  as  essential 
and  taken  into  account  in  making  the  several  observations,  must  not  be  determined  by  rule 
of  thumb,  or  arbitrarily,  but  must  always  be  guided  by  theory. 

Subject  to  this  it  must  as  a  rule  be  considered  best,  that  the  two  sides  of  the 
science  should  work  somewhat  independently  of  one  another,  each  in  its  own  particular 


way.  In  what  follows  the  empirical  side  will  he  treated  exclusively,  and  it  will  be  treated 
mi  a  general  plan,  investigating  not  Mir  particular  way  in  which  statistical,  chemical,  phy- 
sical, and  astronomical  observations  are  made,  but  the  common  rules  according  to  which 
they  arc  all  submitted  to  computation. 


IT.     LAWS  OF  ERRORS. 

jj  5.  Every  observation  is  supposed  to  contain  information,  partly  as  to  the 
phenomenon  in  which  we  are  particularly  interested,  partly  as  to  all  the  circumstances, 
connected  with  it,  which  are  regarded  as  essential.  In  comparing  several  observations,  it 
makes  a  very  great  difference,  whether  such  essential  circumstances  have  remained  unchanged, 
or  whether  one  or  several  of  them  have  changed  between  one  observation  and  another. 
The  treatment  of  the  former  case,  that  of  repetitions,  is  far  simpler  than  that  of  the  latter, 
and  is  therefore  more  particularly  the  subject -of  our  investigations;  nevertheless,  we  must 
try  to  master  also  the  more  difficult  general  case  in  its  simplest  forms,  which  force  them- 
selves upon  us  in  most  of  the  empirical  sciences. 

By  repetitions  then  we  understand  those  observations,  in  which  all  the  essential 
circumstances  remain  unchanged,  in  which  therefore  the  results  or  phenomena  should  agree, 
if  all  the  operative  causes  had  been  included  among  our  essential  circumstances.  Further- 
more, we  can  without  hesitation  treat  as  repetitions  those  observations,  in  which  we  assume 
that  no  essential  circumstance  has  changed,  but  do  not  know  for  certain  that  there  has 
been  no  such  change.  Strictly  speaking,  this  would  furnish  an  example  of  observations 
with  systematic  errors;  but  provided  there  has  been  no  change  in  the  care  with  which  the 
essential  circumstances  have  been  determined  or  checked,  it  is  permissible  to  employ  the 
simpler  treatment  applicable  to  the  case  of  repetitions.  This  would  not  however  be  per- 
missible, if,  for  instance,  the  observer  during  the  repetitions  has  perceived  any  uncertainty 
in  the  records  of  a  circumstance,  and  therefore  paid  greater  attention  to  the  following 
repetitions. 

§  6.  The  special  features  of  the  observations,  and  in  particular  their  degree  of 
accuracy,  depend  on  causes  which  have  been  left  out  as  unessential  circumstances,  or  on 
some  overlooked  uncertainty  in  the  statement  of  the  essential  circumstances.  Consequently 
no  speculation  can  indicate  to  us  the  accuracy  and  particularities  of  observations.  -These 
must  be  estimated  by  comparison  of  the  observations  with  each  other,  but  only  in  the 
case  of  repetitions  can  this  estimate  be  undertaken  directly  and  without  some  preliminary 
work.  The  phrase  law  of  errors  is  used  as  a  general  name  for  any  mathematical  expres- 
sion representing  the  distribution  of  the  varying  results  of  repetitions. 
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Laws  of  actual  errors  are  such  as  correspond  to  repetitions  actually  carried  out. 
But  observations  yet  unmade  may  also  be  erroneous,  and  where  we  have  to  speak  hypo- 
thetically  about  observations,  or  have  to  do  with  the  prediction  of  results  of  future  repe- 
titions, we  are  generally  obliged  to  employ  the  idea  of  "laws  of  errors".  In  order  to  pre- 
vent any  misunderstanding  we  then  call  this  idea  "laws  of  presumptive  errors".  The  two 
kinds  of  laws  of  errors  cannot  generally  be  quite  the  same  thing.  Every  variation  in  the 
number  of  repetitions  must  entail  some  variations  in  the  corresponding  law  of  errors;  and 
if  we  compare  two  laws  of  actual  errors  obtained  from  repetitions  of  the  same  kind  in 
equal  number,  we  almost  always  observe  great  differences  in  every  detail.  In  passing  from 
actual  repetitions  to  future  repetitions,  such  differences  at  least  are  to  be  expected.  More- 
over, whilst  any  collection  of  observations,  which  can  at  all  be  regarded  as  repetitions,  will 
on  examination  give  us  its  law  of  actual  errors,  it  is  not  every  series  of  repetitions  that 
can  be  used  for  predictions  as  to  future  observations.  If,  for  instance,  in  repeated  measure- 
ments of  an  angle,  the  results  of  our  first  measurements  all  fell  within  the  first  quadrant, 
while  the  following  repetitions  still  more  frequently,  and  at  last  exclusively,  fell  within  the 
second  quadrant,  and  even  commenced  to  pass  into  the  third,  it  would  evidently  be  wrong 
to  predict  that  the  future  repetitions  would  repeat  the  law  of  actual  errors  for  the  totality 
of  these  observations.  In  similar  cases  the  observations  must  be  rejected  as  bad  or  mis- 
conceived, and  no  law  of  presumptive  errors  can  be  directly  based  upon  them. 

§  7.  Suppose,  however,  that  on  comparing  repetitions  of  some  observation  we  have 
several  times  determined  the  law  of  actual  errors  in  precisely  the  same  way,  employing  at 
first  small  numbers  of  repetitions,  then  larger  and  still  larger  numbers  for  each  law.  If 
then,  on  comparing  these  laws  of  actual  errors  with  one  another,  we  remark  that  they  be- 
come more  alike  in  proportion  as  the  numbers  of  repetitions  grow  greater,  and  that  the 
agreements  extend  successively  to  all  those  details  of  the  law  which  are  not  by  necessity 
bound  to  vary  with  the  number  of  repetitions,  then  we  cannot  have  any  hesitation  in  using 
the  law  of  actual  errors,  deduced  from  the  largest  possible  number  of  repetitions,  for  pre- 
dictions concerning  future  observations,  made  under  essentially  the  same  circumstances. 

This,  however,  is  wholly  legitimate  only,  when  it  is  to  be  expected  that,  if  we  could 
obtain  repetitions  in  indefinitely  increasing  numbers,  the  law  of  errors  would  then  approach 
a.  single  definite  form,  namely  the  law  of  presumptive  errors  itself,  and  would  not  oscillate 
between  several  forms,  or  become  altogether  or  partly  indeterminate.  (Note  the  analogy 
with  the  difference  between  converging  and  oscillating  infinite  series).  We  must  therefore 
distinguish  between  good  and  bad  observations,  and  only  the  good  ones,  that  is  those  which 
satisfy  the  above  mentioned  condition,  the  law  of  large  numbers,  yield  laws  of  presumptive 
errors  and  afford  a  basis  for  prediction. 

As  we  cannot  repeat  a  thing  indefinitely  often,  we  can  never  be  quite  certain  that 


;i  given  method  of  observation  may  l»n  culled  good.  Nevertheless,  we  shall  always  rely  on 
la\\s  of  actual  errors,  deduced  from  very  large  numbers  of  concordant  repetitions,  as  suffi- 
ciently accurate  approximations  to  the  law  of  presumptive  errors. 

And,  moreover,  the  purely  hypothetical  assumption  of  the  existence  of  a  law  of 
presumptive  errors  may  yield  some  special  criteria  for  the  right  behaviour  of  the  laws  of 
actual  errors,  corresponding  to  the  increasing  number  of  the  repetitions,  and  establish  the 
conditions  necessary  to  justify  their  use  for  purposes  of  prediction. 

We  must  here  notice  that,  when  a  series  of  repetitions  by  such  a  test  proves  bad 
and  inapplicable,  we  shall  nevertheless  often  be  able,  sometimes  by  a  theoretical  criticism 
of  the  method,  and  sometimes  by  watching  the  peculiarities  in  the  irregularities  of  the  laws 
of  errors,  to  find  out  the  reason  why  the  given  method  of  observation  is  not  as  good  as 
others,  and  to  change  it  so  that  the  checks  will  at  least  show  that  it  has  been  improved. 
In  the  case  mentioned  in  the  preceding  paragraph,  for  instance,  the  remedy  is  obvious.  The 
time  of  observation  is  there  to  be  reckoned  among  the  essential  circumstances. 

And  if  we  do  not  attain  our  object,  but  should  fail  in  many  attempts  at  throwing 
light  upon  some  phenomenon  by  means  of  good  observations,  it  may  be  said  even  at  this 
stage,  before  we  have  been  made  acquainted  with  the  various  means  that  may  be  employed, 
and  the  various  forms  taken  by  the  laws  of  errors,  that  absolute  abandonment  of  the  law 
of  large  numbers,  as  quite  inapplicable  to  any  given  refractory  phenomenon,  will  generally 
be  out  of  the  question.  After  repeated  failures  we  may  for  a  time  give  up  the  whole 
matter  in  despair;  but  even  the  most  thorough  sceptic  may  catch  himself  speculating  on 
what  may  be  the  cause  of  his  failure,  and,  in  doing  so,  he  must  acknowledge  that  the 
error  is  never  to  be  looked  for  in  the  objective  nature  of  the  conditions,  but  in  an  insuffi- 
cient development  of  the  methods  employed.  From  this  point  of  view  then  the  law  of 
large  numbers  has  the  character  of  a  belief.  There  is  in  all  external  conditions  such  a 
harmony  with  human  thought  that  we,  sooner  or  later,  by  the  use  of  due  sagacity,  parti- 
cularly with  regard  to  the  essential  subordinate  circumstances  of  the  case,  will  be  able  to 
give  the  observations  such  a  form  that  the  laws  of  actual  errors,  with  respect  to  repetitions 
in  increasing  numbers,  will  show  an  approach  towards  a  definite  form,  which  may  be  con- 
sidered valid  as  the  law  of  presumptive  errors  and  used  for  predictions. 

§  8.     Four  different  means  of  representing  the  law  of  errors  must  be  described,  and 
their  respective  merits  considered,  namely: 
Tabular  arrangements, 
Curves  of  Errors, 
Functional  Laws  of  Errors, 
Symmetric  Functions  of  the  Repetitions. 

In   comparing  these  means  of  representing  the  laws  of  errors,   we   must   take  into 


consideration  which  of  them  is  the  easiest  to  employ,  and  neither  this  nor  the  description 
of  the  forms  of  the  laws  of  errors  demands  any  higher  qualification  than  an  elementary 
knowledge  of  mathematics.  But  we  must  take  into  account  also,  how  far  the  different  forms 
are  calculated  to  emphasise  the  important  features  of  the  laws  of  errors,  i.  e.  those  which 
may  be  transferred  from  the  laws  of  actual  errors  to  the  laws  of  presumptive  errors.  On 
this  single  point,  certainly,  a  more  thorough  knowledge  of  mathematics  would  be  desirable 
than  that  which  may  be  expected  from  the  majority  of  those  students  who  are  obliged  to 
occupy  themselves  with  observations.  As  the  definition  of  the  law  of  presumptive  errors 
presupposes  the  determination  of  limiting  values*,  to  infinitely  numerous  approximations, 
some  propositions  from  the  differential  calculus  would,  strictly  speaking,  be  necessary. 


III.     TABULAR  ARRANGEMENTS. 

§  9.  In  stating  the  results  of  all  the  several  repetitions  we  give  the  law  of  errors 
in  its  simplest  form.  Identical  results  will  of  course  be  noted  by  stating  the  number  of 
the  observations  which  give  them. 

The  table  of  errors,  when  arranged,  will  state  all  the  various  results  and  the  fre- 
quency of  each  of  them. 

The  table  of  errors  is  certainly  improved,  when  we  include  in  it  the  relative  fre- 
quencies of  the  several  results,  that  is,  the  ratio  which  each  absolute  frequency  bears  to  the 
total  number  of  repetitions.  It  must  be  the  relative  frequencies  which,  according  to  the 
law  of  large  numbers,  are,  as  the  number  of  observations  is  increased,  to  approach  the 
constant  values  of  the  law  of  presumptive  errors.  Long  usage  gives  us  a  special  word  to 
denote  this  transition  in  our  ideas:  probability  is  the  relative  frequency  in  a  law  of  pre- 
sumptive errors,  the  proportion  of  the  number  of  coincident  results  to  the  total  number, 
on  the  supposition  of  infinitely  numerous  repetitions.  There  can  be  no  objection  to  con- 
sidering the  relative  frequency  of  the  law  of  actual  errors  as  an  approximation  to  the 
corresponding  probability  of  the  law  of  presumptive  errors,  and  the  doubt  whether  the 
relative  frequency  itself  is  the  best  approximation  that  can  be  got  from  the  results  of  the 
given  repetitions,  is  rather  of  theoretical  than  practical  interest.  Compare  §  73. 

It  makes  some  difference  in  several  other  respects  -  -  as  well  as  in  the  one  just 
mentioned  --if  the  phenomenon  is  such  that  the  results  of  the  repetitions  show  qualitative 
differences  or  only  differences  of  magnitude. 

§  10.  In  the  former  case,  in  which  no  transition  occurs,  but  where  there  are  such 
abrupt  differences  that  none  of  the  results  are  more  closely  connected  with  one  another  than 
with  the  rest,  the  tabular  form  will  be  the  only  possible  one,  in  which  the  law  of  errors  can 
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be  given.  This  case  frequently  occurs  in  statistics  and  in  games  of  chance,  and  for  this 
reason  the  theory  of  probabilities,  which  is  the  form  of  the  theory  of  observations  in  which 
these  cases  are  particularly  taken  into  consideration,  demands  special  attention.  All  pre- 
vious authors  have  begun  with  it,  and  made  it  the  basis  of  the  other  parts  of  the  science 
of  observation.  I  am  of  opinion,  however,  that  it  is  both  safer  and  easier  to  keep  it  to 
the  last. 

>M1.  If,  however,  there  is  such  a  difference  between  the  results  of  repetitions, 
tlnil,  there  is  either  a  continuous  transition  between  them,  or  that  some  results  are  nearer 
each  other  than  all  the  rest,  there  will  be  ample  opportunity  to  apply  mathematical  methods; 
mid  when  the  tabular  form  is  retained,  we  must  take  care  to  bring  together  the  results 
that  are  near  one  another.  A  table  of  the  results  of  firing  at  a  target  may  for  instance 
have  the  following  form : 


1  foot  too  high  

1  foot  to  the  left 
3 

Central 
17 

1  foot  to  the  right 
6 

Total 
26 

Central     

13 

109 

19 

141 

1  foot  too  low  

4 

8 

1 

13 

Total  .. 

20 

134 

26 

180 

If  here  the  heading  "1  foot  to  the  left"  means  that  the  shot  has  swerved  to  the 
left  between  half  a  foot  and  one  foot  and  a  half,  this  will  remind  us  that  we  cannot  give 
the  exact  measures  in  such  tables,  but  are  obliged  to  give  them  in  round  numbers.  The 
number  of  results  then  will  not  correspond  to  such  as  were  exactly  the  same,  but  dis- 
regarding small  differences,  we  gather  into  each  column  those  that  approach  nearest  to  one 
another,  and  which  all  fall  within  arbitrarily  chosen  limits. 

In  the  simple  case,  where  the  result  of  the  observation  can  be  expressed  by  a 
single  real  number,  the  arranged  table  not  only  takes  the  extremely  simple  form  of  a  table 
of  functions  with  a  single  argument,  but,  as  we  shall  see  in  the  following  chapters,  leads 
us  to  the  representation  of  the  law  of  errors  by  means  of  curves  of  errors  and  functional 
laws  of  errors. 

It  is  an  obvious  course  to  fix  the  attention  on  the  two  extreme  results  in  the  table, 
and  not  seldom  these  alone  are  given,  instead  of  a  law  of  error,  as  a  sort  of  index  of  the 
exactness  of  the  whole  series  of  repetitions,  and  as  the  higher  and  lower  limits  of  the 
observed  phenomenon.  This  index  of  exactness,  however,  must  be  rejected  as  itself  too 
inexact  for  the  purpose,  for  the  oftener  the  observations  are  repeated,  the  farther  we  must 
expect  the  extremes  to  move  from  one  another;  and  thus  the  most  valuable  series  of 
observations  will  appear  to  possess  the  greatest  range  of  discrepancy. 
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On  the  other  hand,  if,  in  a  table  arranged  according  to  the  magnitude  of  the 
values,  we  select  a  single  middle  value,  preceded  and  followed  by  nearly  equal  numbers  of 
values,  we  shall  get  a  quantity  which  is  very  well  fitted  to  represent  the  whole  series  of 
repetitions. 

If,  while  we  are  thus  counting  the  results  arranged  according  to  'their  magnitude, 
we  also  take  note  of  those  two  values  with  which  we  respectively  (a)  leave  the  first  sixth 
part  of  the  total  number,  and  (b)  enter  upon  the  last  sixth  part  (more  exactly  we  ought  to 
say  16  per  ct.),  we  may  consider  these  two  as  indicating  the  limits  between  great  and  small 
deviations.  If  we  state  these  two  values  along  with  the  middle  one  above  referred  to,  we 
give  a  serviceable  expression  for  the  law  of  errors,  in  a  way  which  is  very  convenient,  and 
although  rough,  is  not  to  be  despised.  Why  we  ought  to  select  just  the  middle  value  and 
the  two  sixth-part  values  for  this  purpose,  will  appear  from  the  following  chapters. 


IV.     CURVES  OF  ERROES. 

§  12.  Curves  of  actual  errors  of  repeated  observations,  each  of  which  we  must  be 
able  to  express  by  one  real  number,  are  generally  constructed  as  follows.  On  a  straight 
line  as  the  axis  of  abscissae,  we  mark  oft'  points  corresponding  to  the  observed  numerical 
quantities,  and  at  each  of  these  points  we  draw  an  ordinate,  proportional  to  the  number 
of  the  repetitions  which  gave  the  result  indicated  by  the  abscissa.  We  then  with  a  free 
hand  draw  the  curve  of  errors  through  the  ends  of  the  ordinates,  making  it  as  smooth 
and  regular  as  possible.  For  quantities  and  their  corresponding  abscissae  which,  from  the 
nature  of  the  case,  might  have  appeared,  but  do  not  really  appear,  among  the  repetitions, 
the  ordinate  will  be  =  0,  or  the  point  of  the  curve  falls  on  the  axis  of  abscissae.  Where 
this  case  occurs  very  frequently,  the  form  of  the  curves  of  errors  becomes  very  tortuous, 
almost  discontinuous.  If  the  observation  is  essentially  bound  to  discontinuous  numbers,  for 
instance  to  integers,  this  cannot  be  helped. 

§  13.  If  the  observation  is  either  of  necessity  or  arbitrarily,  in  spite  of  some  in- 
evitable loss  of  accuracy,  made  in  round  numbers,  so  that  it  gives  a  lower  and  a  higher 
limit  for  each  observation,  a  somewhat  different  construction  of  the  curve  of  errors  ought 
to  be  applied,  viz.  such  a  one,  that  the  area  included  between  the  curve  of  error,  the  axis 
of  abscissae,  and  the  ordinates  of  the  limits,  is  proportional  to  the  frequency  of  repetitions 
within  these  limits.  But  in  this  way  the  curve  of  errors  may  depend  very  much  on  the 
degree  of  accuracy  involved  in  the  use  of  round  numbers.  This  construction  of  areas 
can  be  made  by  laying  down  rectangles  between  the  bounding  ordinates,  or  still  better, 
trapezoids  with  their  free  sides  approximately  parallel  to  the  tangents  of  the  curve.  If  the 
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limiting  round  numbers  are  equidistant,  the  mean  heights  of  the  trapezoids  or  rectangles 
;H.  directly  proportional  to  the  frequencies  of  repetition.  In  this  rase  a  preliminary  con- 
struction of  curve-points  can  be  made  as  in  $  llf,  and  may  often  be  used  as  sufficient. 

It,  is  a  very  common  custom,  but  one  not  to  be  recommended,  to  draw  a  broken 
line  between  the  observed  points  instead  of  a  curve. 

$14.  There  can  be  no  doubt  that  the  curve  of  errors,  as  a  form  for  the  law  of 
errors,  has  the  advantage  of  perspicuity,  and  were  not  the  said  uncertainty  in  so  many 
cases  a  critical  drawback,  this  would  perhaps  be  sufficient.  Moreover,  it  is  in  practice 
quite  possible,  and  not  very  difficult,  to  pass  from  the  curve  of  actual  errors  to  one  which 
may  hold  good  for  presumptive  errors;  though,  certainly,  this  transition  cannot  be  founded 
upon  any  positive  theory,  but  depends  on  skill,  which  may  be  acquired  by  working  at  good 
examples,  but  must  be  practised  judiciously. 

According  to  the  law  of  large  numbers  we  must  expect  that,  when  we  draw  curves 
of  actual  errors  according  to  relative  frequency,  for  a  numerous  series  of  repetitions,  first 
based  upon  small  numbers,  afterwards  redrawn  every  time  as  we  get  more  and  more  repe- 
titions, the  curves,  which  at  first  constantly  changed  their  forms  and  were  plentifully 
furnished  with  peaks  and  valleys,  will  gradually  become  more  like  each  other,  as  also 
simpler  and  more  smooth,  so  that  at  last,  when  we  have  a  very  large  but  finite  number 
of  observations,  we  cannot  distinguish  the  successive  figures  we  have  drawn  from  one  an- 
other. We  may  thus  directly  construct  curves  of  errors,  which  may  be  approved  as  pictures 
of  curves  of  presumptive  errors,  but  in  order  to  do  so  millions  of  repetitions,  rather  than 
thousands,  are  certainly  required. 

If  from  curves  of  actual  errors  for  small  numbers  we  are  to  draw  conclusions  as 
to  the  curve  of  presumptive  errors,  we  must  guess,  but  at  the  same  time  support  our  guess, 
partly  by  an  estimate  of  how  great  irregularities  we  may  expect  in  a  curve  of  actual  errors 
for  the  given  number,  partly  by  developing  our  feeling  for  the  form  of  regular  curves  of 
that  sort,  as  we  must  suppose  that  the  curves  of  presumptive  errors  will  be  very  regular. 
In  both  respects  we  must  get  some  practice,  but  this  is  easy  and  interesting. 

Without  feeling  tied  down  to  the  particular  points  that  determined  the  curve  of 
actual  errors,  we  shall  nevertheless  try  to  approach  them,  and  especially  not  allow  many 
large  deviations  on  the  same  side  to  come  together.  We  can  generally  regard  as  large 
deviations  (the  reason  why  will  be  mentioned  in  the  chapter  on  the  Theory  of  Probabilities) 

those  that  cause  greater  errors,   as   compared   with  the  absolute  frequency   of  the  result 

_  i         it 
in  question,  than  the  square  root  of  that  number  (more  exactly  yh  —   -,  where  h  is  the 

frequency  of  the  result,  n  the  number  of  all  repetitions).  But  even  deviations  two  or  three 
times  as  great  as  this  ought  not  always  to  be  avoided,  and  we  may  be  satisfied,  if  only 
one  third  of  the  deviations  of  the  determining  points  must  be  called  large.  We  may  use 
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the  word  "adjustment"  (graphical)  to  express  the  operation  by  which  a  curve  of  presumptive 
errors  is  determined.  (Comp.  §  64).  The  adjustment  is  called  an  over-adjustment,  if  we 
have  approached  too  near  to  some  imaginary  ideal,  but  if  we  have  kept  too  close  to  the 
curve  of  actual  errors,  then  the  curve  is  said  to  be  under-adjusted. 

Our  second  guide,  the  regularity  of  the  curve  of  errors,  is  as  an  sesthetical  notion 
of  a  somewhat  vague  kind.  The  continuity  of  the  curve  is  an  essential  condition,  but  it 
is  not  sufficient.  The  regularity  here  is  of  a  somewhat  different  kind  from  that  seen  in 
the  examples  of  simple,  continuous  curves  with  which  students  more  especially  become 
acquainted.  The  curves  of  errors  get  a  peculiar  stamp,  because  we  would  never  select  the 
essential  circumstances  of  the  observation  so  absurdly  that  the  deviations  could  become 
indefinitely  large.  Nor  would  we  without  necessity  retain  a  form  of  observation  which 
might  bring  about  discontinuity.  It  follows  that  to  the  abscissae  which  indicate  very  large 
deviations,  must  correspond  rapidly  decreasing  ordinates.  The  curve  of  errors  must  have 
the  axis  of  abscissae  as  an  asymptote,  both  to  the  right  and  the  left.  All  frequency  being 
positive,  where  the  curve  of  errors  deviates  from  the  axis  of  abscissae,  it  must  exclusively 
keep  on  the  positive  side  of  the  latter.  It  must  therefore  more  or  less  get  the  appearance 
of  a  bow,  with  the  axis  of  abscissae  for  the  string.  In  order  to  train  the  eye  for  the 
apprehension  of  this  sort  of  regularity,  we  recommend  the  study  of  figs.  2  &  3,  which 
represent  curves  of  errors  of  typical  forms,  exponential  and  binomial  (comp.  the  next  chapter, 
p.  16,  seqq.),  and  a  comparison  of  them  with  figures  which,  like  Nr.  1,  are  drawn  from 
actual  observations  without  any  adjustment. 

The  best  way  to  acquire  practice  in  drawing  curves  of  errors,  which  is  so  important 
that  no  student  ought  to  neglect  it,  may  be  to  select  a  series  of  observations,  for  which 
the  law  of  presumptive  errors  may  be  considered  as  known,  and  which  is  before  us  in 
tabular  form. 

We  commence  by  drawing  curves  of  actual  errors  for  the  whole  series  of  observa- 
tions; then  for  tolerably  large  groups  of  the  same,  and  lastly  for  small  groups  taken  at 
random  and  each  containing  only  a  few  observations.  On  each  drawing  we  draw  also, 
besides  the  curve  of  actual  errors,  another  one  of  the  presumptive  errors,  on  the  same 
scale,  so  that  the  abscissae  are  common,  and  the  ordinates  indicate  relative  frequencies  in 
proportion  to  the  same  unit  of  length  for  the  total  number.  The  proportions  ought  to  be 
chosen  so  that  the  whole  part  of  the  axis  of  abscissae  which  deviates  sensibly  from  the 
curve,  is  between  2  and  5  times  as  long  as  the  largest  ordinate  of  the  curve. 

Prepared  by  the  study  of  the  differences  between  the  curves,  we  pass  on  at  last 
to  the  construction  of  curves  of  presumptive  errors  immediately  from  the  scattered  points 
of  the  curve  which  correspond  to  the  observed  frequencies.  In  this  construction  we  must 
not  consider  ourselves  obliged  to  reproduce  the  curve  of  presumptive  errors  which  we  may 
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know  beforehand;  our  task  is  to  represent  the  observations  as  nearly  as  possible  by  means 
of  a  curve  which  is  as  smooth  and  regular  as  that  curve. 

The  following  table  of  500  results,  got  by  a  game  of  patience,   may  be  treated  in 
this  way  as  an  exercise. 
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The  law  of  presumptive  errors  here  given  is  not  the  direct  result  of  free-hand  con- 
struction; but  the  curve  so  got  has  been  improved  by  interpolation  of  the  logarithms  of 
its  statements  of  the  relative  frequencies,  together  with  the  formation  of  mean  numbers 
for  the  deviations,  a  proceeding  which  very  often  will  give  good  results,  but  which  is  not 
strictly  necessary.  By  this  we  can  also  determine  the  functional  law  of  errors  (Com p.  the 
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next  chapter).    The  equation  of  the  curve  is: 
Logy  =  2-0228 +  0-0030  (a?— 11)  —  0-6885  (*— 11)2+ 0-01515  (.*—  II)3  —  O001678(#—  II)4 

§  15.  By  the  study  of  many  curves  of  presumptive  errors,  and  especially  such  as 
represent  ideal  functional  laws  of  errors,  we  cannot  fail  to  get  the  impression  that  there 
exists  a  typical  form  of  curves  of  errors,  which  is  particularly  distinguished  by  symmetry. 
Familiarity  with  this  form  is  useful  for  the  construction  of  curves  of  presumptive  errors. 
But  we  must  not  expect  to  get  it  realised  in  all  cases.  For  this  reason  I  have  considered 
it  important  to  give,  alongside  of  the  typical  curves,  an  example  taken  from  real  observa- 
tions of  a  skew  -curve  of  errors,  which  in  consequence  of  its  marked  want  of  symmetry 
deviates  considerably  from  the  typical  form.  Fig.  4  shows  this  last  mentioned  law  of 
presumptive  errors. 

Deviation  from  the  typical  form  does  not  indicate  that  the  observations  are  not 
good.  But  it  may  become  so  glaring  that  we  are  forced  by  it  to  this  conclusion.  If,  for 
instance,  between  the  extreme  values  of  repetitions  —  abscissae  —  there  are  intervals  which 
are  as  free  from  finite  ordinates  as  the  space  beyond  the  extremes,  so  that  the  curve  of 
errors  is  divided  into  two  or  several  smaller  curves  of  errors  beside  one  another,  there  can 
scarcely  be  any  doubt  that  we  have  not  a  series  of  repetitions  proper,  but  a  combination 
of  several;  that  is  to  say,  different  methods  of  observation  have  been  used  and  the  results 
mixed  up  together.  In  such  cases  we  cannot  expect  that  the  law  of  large  numbers  will 
remain  in  force,  and  we  had  better,  therefore,  reject  such  observations,  if  we  cannot  retain 
them  by  tracing  out  the  essential  circumstances  which  distinguish  the  groups  of  the  series, 
but  have  been  overlooked. 

§  16.  When  a  curve  of  presumptive  errors  is  drawn,  we  can  measure  the  magnitude 
of  the  ordinate  for  any  given  abscissa;  so  far  then  we  know  the  law  of  errors  perfectly,  by 
means  of  the  curve  of  errors,  but  certainly  in  the  tabular  form  only,  with  all  its  copious- 
ness. Whether  we  can  advance  further  depends  on,  whether  we  succeed  in  interpolating  in 
the  table  so  found,  and  particularly  on,  whether  we  can,  either  from  the  table  or  direct  from 
the  curve  of  errors,  by  measurement  obtain  a  comparatively  small  number  of  constants,  by 
which  to  determine  the  special  peculiarities  of  the  curve. 

By  interpolating,  by  means  of  Newton's  formula,  the  logarithms  of  the  frequencies, 
or  by  drawing  the  curves  of  errors  with  the  logarithms  of  the  frequencies  as  ordinates, 
we  often  succeed,  as  above  mentioned,  in  giving  the  curve  the  form  of  a  parabola  of  low 
(and  always  even)  degree. 

Still  easier  is  it  to  make  use  of  the  circumstance  that  fairly  typical  curves  of  errors 
show  a  single  maximum  ordinate,  and  an  inflexion  on  each  side  of  it,  near  which  the 
curve  for  a  short  distance  is  almost  rectilinear.  By  measuring  the  co-ordinates  of  the 
maximum  point  and  of  the  points  of  inflexion,  we  shall  get  data  sufficient  to  enable  us  to 


15 

draw  a  curve  of  errors  which,  as  a  rule,  will  deviate  very  little  from  the  original.  All  this, 
however,  holds  good  only  of  the  curves  of  presumptive  errors.  With  the  actual  ones  we 
cannot  operate  in  this  way,  and  the  transition  from  the  latter  to  the  former  seems  in  the 
meantime  to  depend  on  the  eye's  sense  of  beauty. 


V.     FUNCTIONAL   LAWS   OF   ERRORS. 

$  17.  Laws  of  errors  may  be  represented  in  such  a  way  that  the  frequency  of 
the  results  of  repetitions  is  stated  as  a  mathematical  function  of  the  number,  or  numbers, 
expressing  the  results.  This  method  only  differs  from  that  of  curves  of  errors  in  the 
circumstance  that  the  curve  which  represents  the  errors  has  been  replaced  by  its  mathema- 
tical formula;  the  relationship  is  so  close  that  it  is  difficult,  when  we  speak  of  these  two 
methods,  to  maintain  a  strict  distinction  between  them. 

In  former  works  on  the  theory  of  observations  the  functional  law  of  errors  is  the 
principal  instrument.  Its  source  is  mathematical  speculation;  we  start  from  the  properties 
which  are  considered  essential  in  ideally  good  observations.  From  these  the  formula  for 
the  typical  functional  law  of  errors  is  deduced;  and  then  it  remains  to  determine  how 
to  make  computations  with  observations  in  order  to  obtain  the  most  favourable  or  most 
probable  results. 

Such  investigations  have  been  carried  through  with  a  high  degree  of  refinement; 
but  it  must  be  regretted  that  in  this  way  the  real  state  of  things  is  constantly  disregarded. 
The  study  of  the  curves  of  actual  errors  and  the  functional  forms  of  laws  of  actual  errors 
have  consequently  been  too  much  neglected. 

The  representation  of  functional  laws  of  errors,  whether  laws  of  actual  errors  or  laws 
of  presumptive  errors  founded  on  these,  must  necessarily  begin  with  a  table  of  the  results 
of  repetitions,  and  be  founded  on  interpolation  of  this  table.  We  may  here  be  content  to 
study  the  cases  in  which  the  arguments  (i.  e.  the  results  of  the  repetitions)  proceed  by 
constant  differences,  and  the  interpolated  function,  which  gives  the  frequency  of  the 
argument,  is  considered  as  the  functional  law  of  errors.  Here  the  only  difficulty  we  en- 
counter is  that  we  cannot  directly  employ  the  usual  Newtonian  formula  of  interpolation, 
as  this  supposes  that  the  function  is  an  integral  algebraic  one,  and  gives  infinite  values 
for  infinite  arguments,  whether  positive  or  negative,  whereas  here  the  frequency  of  these 
infinite  arguments  must  be  =  0.  We  must  therefore  employ  some  artifice,  and  an  obvious 
one  is  to  interpolate,  not  the  frequency  itself,  y,  but  its  reciprocal,  — .  This,  however,  turns 
out  to  be  inapplicable;  for  -r  will  often  become  infinite  for  finite  arguments,  and  will,  at 
any  rate,  increase  much  faster  than  any  integral  function  of  low  degree. 
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But,  as  we  have  already  said,  the  interpolation  generally  succeeds,  when  we  apply 
it  to  the  logarithm  of  the  frequency,  assuming  that 

Log  y  =  a  -}-  bx  -j-  cxz  -f-  ...-(-  gx  2|}, 

where  the  function  on  the  right  side  begins  with  the  lowest  powers  of  the  argument  x, 
and  ends  with  an  even  power  whose  coefficient  y  must  be  neyative.  Without  this  latter 
condition  the  computed  frequency, 

y    ^    lQa+bx  +  cxz+...  +  gx*nii  QJ 

would  again  become  infinitely  great  for  x  ==  -j-  oo  .  That  the  observed  frequency  is  often 
=  0,  and  its  logarithm  GO  like  —  ,  does  no  harm.  Of  course  we  must  leave  out 

these  frequencies  of  the  interpolation,  or  replace  them  by  very  small  finite  frequencies,  a 
few  of  which  it  may  become  necessary  to  select  arbitrarily.  As  a  rule  it  is  possible  to 
succeed  by  this  means.  In  order  to  represent  a  given  law  of  actual  errors  in  this  way,  we 
must,  according  to  the  rule  of  interpolation,  determine  the  coefficients  a,  6,  c,  ...</,  whose 
number  must  be  at  least  as  large  as  that  of  the  various  results  of  repetitions  with  which 
we  have  to  deal.  This  determination,  of  course,  is  a  troublesome  business. 

Here  also  we  may  suppose  that  the  law  of  presumptive  errors  is  simpler  than  that 
of  the  actual  errors.  And  though  this,  of  course,  does  not  imply  that  log  y  can  be  ex- 
pressed by  a  small  number  of  terms  containing  the  lowest  powers  of  #,  this  supposition, 
nevertheless,  is  so  obvious  that  it  must,  at  any  rate,  be  tried  before  any  other. 

§  18.    Among  these,  the  simplest  case,  namely  that  in  which  Log  y  is  a  function 

of  x  of  the  second  degree 

Logy  =  a-\-bx  —  e#2, 

gives  us  the  typical  form  for  the  functional  law  of  errors,  and  for  the  curve  of  errors,  or 
with  other  constants 

i  /x—m\v  ,    /x—m\" 

y  ==  farlH5)  =  Mo-0*17"!—)  ,  (2) 

where 


The  function  has  therefore  no  other  constants  than  those  which  may  be  interpreted 
as  unit  for  the  frequencies  /z,  and  as  zero  m  and  unit  n  for  the  observed  values;  the 
corresponding  typical  curve  of  errors  has  therefore  in  all  essentials  a  fixed  form. 

The  functional  form  of  the  typical  law  of  errors  has  applications  in  mathematics 
which  are  almost  as  important  as  those  of  the  exponential,  logarithmic,  and  trigonometrical 
functions.  In  the  theory  of  observations  its  importance  is  so  great  that,  though  it  has 
been  over-estimated  by  some  writers,  and  though  many  good  observations  show  presumptive 
as  well  as  actual  laws  of  errors  that  are  not  typical,  yet  every  student  must  make  himself 
perfectly  familiar  with  its  properties. 
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Kxpamlin.i:  th»>  iml<-x  we  get 

—  i-(*~m\*  -i(m\t      £?!        W 

e~~*\~ir)  —  0  7w  •  e~«f  •€"'*(*) ,  (3) 

so  that  the  general  function  resolves  itself  into  a  product  of  three  factors,  the  first  of  which 
is  constant,  the  second  an  ordinary  exponential  function,  while  the  third  remains  a  typical 
film  lional  law  of  errors.  Long  usage  reduces  this  form  to  e "*;  but  this  form  cannol  In- 
recommended.  In  the  majority  of  its  purely  mathematical  applications  e~mi  is  preferable, 
unless  (as  in  the  whole  theory  of  observations)  the  factor  in  the  index  is  to  be  preferred 

on  account  of  the  resulting  simplification  of  most  of  the  derived  formulae. 

_-!/.?  V 
The  differential  coefficients  of  e    2W   with  regard  to  x  are 

--f-\*  !/TV 

De    z\n)  =  — tt~2xe  "2\nj 


(4) 


The  law  of  the  numerical  coefficients  (products  of  odd  numbers  and  binomial 
numbers)  is  obvious.  The  general  expression  of  Z>'<?~~2W  can  be  got  from  a  comparison 
of  the  coefficients  to  ( — »»)•'  of  the  two  identical  series  for  equation  (3),  one  being  the  Taylor 
series,  the  other  the  product  of  eTTW  and  the  two  exponential  series  with  w2  and  m  as 
arguments.  It  can  also  be  induced  from  the  differential  equation 

Inversely,  we  obtain  for  the  products  of  the  typical  law  of  errors  by  powers  of  x 


-f- 


(5) 


the  numerical   coefficients   being  the  same  as  above  (4).    This  proposition  can  be  demon- 
strated by  the  identical  equation  n-*af+*<p  =  —  D(xry)-{-rxr-i<p. 

By  means   of  these  formulae  every   product  of  any   integral  rational  function   by 

3 


18 
exponential  functions  and  functional  typical  laws  of  errors  can  be  reduced  to  the  form 


(6) 


where 


and  thus  they  can  easily  be  differentiated  and  integrated.  Every  quadrature  of  this  form 
can  be  reduced  to 

_iAr  — wi\2  a          t/x—m\* 

f\  (x)  e~^(^r)  -[-  fa  (x)  J  e~J\~f~)  dx, 

where  fv(x)  and  f.>(x)  are  integral  rational  function^;  thus  a  very  large  class  of  problems 
can  be  solved  numerically  by  aid  of  the  following  table  of  the  typical  or  exponential 
functional  law  of  errors,  y  =  e~**\  together  with  the  table  of  its  integral  iwfe. 
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Here  »,  •  •   are,   each  of  them,  the  same  for  positive  and  negative  values 

dz  t     az* 

of  z;  the  other  columns  of  the  table  change  signs  with  z. 

The  interpolations  are  easily  worked  out  by  means  of  Taylor's  theorem: 


«  +  ...  .7) 

and 

^.C'^..    (8) 


The  typical  form  for  the  functional  law  of  errors  (2)  shows  that  the  frequency  is 
always  positive,  and  that  it  arranges  itself  symmetrically  about  the  value  x  =  w,  for  which 
the  frequency  has  its  maximum  value  y  =  h.  For  x^m^n  the  frequency  is  y  =  h  •  0-60653. 
The  corresponding  points  in  the  curve  of  errors  are  the  points  of  inflexion.  The  area 
between  the  curve  of  errors  and  the  axis  of  abscissae,  reckoned  from  the  middle  to  x  =  w^n, 
will  be  nh  •  0-85562  ;  and  as  the  whole  area  from  one  asymptote  to  the  other  is  nh  V'2x 
=  nh  •  2-50663,  only  nh  •  0-39769  of  it  falls  outside  either  of  the  inflexions,  consequently 
not  quite  that  sixth  part  (more  exactly  16  per  ct.)  which  is  the  foundation  of  the  rule, 
given  in  §  11,  as  to  the  limit  between  the  great,,  and  small  errors. 

The  above  table  shows  how  rapidly  the  function  of  the  typical  law  of  errors  de- 
creases toward  zero.  In  almost  all  practical  applications  of  the  theory  of  observations 
e~T**=  0,  if  only  z  >  5.  Theoretically  this  superior  assymptotical  character  of  the  function 
is  expressed  in  the  important  theorem  that,  for  z  =  i00,  not  only  e~~z*  itself  is  =  0 
but  also  all  its  differential  coefficients;  and  that,  furthermore,  all  products  of  this  function 
by  every  algebraic  integral  function  and  by  every  exponential  function,  and  all  the  differential 

quotients  of  these  products,  are  equal  to  zero. 

-+oo 

f   __!_  2  _ 

In  consequence  of  this  theorem,  the  integral  \e~~2*  dz  =  V2jr   can    be   computed 

I  —  oo 

as  the  sum  of  equidistant  values  of  e~  a**  multiplied  by  the  interval  of  the  arguments 
without  any  correction.  This  simple  method  of  computation  is  not  quite  correct,  the 
underlying  series  for  conversion  of  a  sum  into  an  integral  being  only  semiconvergent  in 
this  case;  for  very  large  intervals  the  error  can  be  easily  stated,  but  as  far  as  intervals 
of  one  unit  the  numbers  taken  out  of  our  table  are  not  sufficient  to  show  this  error. 

If  the  curve  of  errors  is  to  give  relative  frequency  directly,  the  total  area  must  be 
1  =  nhVZx',  h  consequently  ought  to  be  put  =  ~T^F* 

71  V  —  .  ». 

Problem  1.     Prove  that  every  product  of  typical  laws  of  errors  in   the  functional 

i  fx—  w\2 

form  =  he~~2\~>r)  ,  with  the  same  independent  variable  x,  is  itself  a  typical  law  of  errors. 
How  do  the  constants  h,  m,  and  n  change  in  such  a  multiplication? 
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Problem  2.     How   small   are   the  frequencies  of  errors  exceeding  2,  3,  or  4  times 
the  mean  error,  on  the  supposition  of  the  typical  law  of  errors? 
Problem  3.    To  find  the  values  of  the  definite  integrals 


Answer: 


sr  =  \xre    z\~irj  dx. 

V  —  00 

=  =  0  and  s2i  =  1  •  3  •  5  ...  (2*  —  1) 


§  19.  Nearly  related  to  the  typical  or  exponential  law  of  errors  in  functional  form 
are  the  binomial  functions,  which  are  known  from  the  coefficients  of  the  terms  of  the  nih 
power  of  a  binomial,  regarded  as  a  function  of  the  number  x  of  the  term. 


n 

0 

1 

2 

X 

3 

4 

5 

6 

7 

0 

1 

i 

1 

1 

2 

1 

2 

1 

3 

1 

3 

3 

1 

4 

1 

4 

6 

4 

1 

5 

1 

5 

10 

10 

5 

1 

6 

1 

6 

15 

20 

15 

6 

1 

7 

1 

7 

21 

35 

35 

21 

7 

1 

8 

1 

8 

28 

56 

70 

56 

28 

8 

9 

1 

9 

36 

84 

126 

126 

84 

36 

10 

1 

10 

45 

120 

210 

252 

210 

120 

11 

1 

11 

55 

165 

330 

462 

462 

330 

12 

1 

12 

66 

220 

495 

792 

924 

792 

13 

1 

13 

78 

286 

715 

1287 

1716 

1716 

14 

1 

14 

91 

364 

1001 

2002 

3003 

3432 

For  integral  values  of  the  argument  the  binomial  function  can  be  computed  directly 

by  the  formula 

r.  ,  ,  1.2-'3...n 

P'l(X)  ""  ~  Pn(1 


1-2.  ..x 


When  the  binomial  numbers  for  n  are  known,   those  for   n  +  1  are  easily  found 

by  the  formula 

/?„+,  (x)  r  /?„  (a;)  +  /?„  (x  -  1).  (10) 

By  substitution  according  to  (9)  we   easily  demonstrate   the  proposition   that,   for 


•Jl 

any  integral  values  of  /<,  /•,  and  t 


(11) 

which  means  that,  when  the  trinomial   (a  -f  b  4-  c)n  is  developed,  it  is  indifferent  whether 
we  consider  it  to  be  ((«  -f  b)  -f-  c)»  or  (a  +  (b  +  c))*. 

For  fractional  values  of  the  argument  x,  the  binomial  function  ftn(x)  can  be  taken 
in  an  infinity  of  different  ways,  for  instance  by 


This  formula  results  from  a  direct  application  of  Lagrange's  method  of  interpolation,  and 
leads  by  (10)  to  the  more  general  formula 

1-2...  n  smnx 


This  species  of  binomial  function  may  be  considered  the  simplest  possible,  and  has 
some  importance  in  pure  mathematics;  but  as  an  expression  of  frequencies  of  observed 
values,  or  as  a  law  of  errors,  it  is  inadmissible  because,  for  x  >  n  or  a;  negative,  it  gives 
negative  values  alternating  with  positive  values  periodically. 

This,  however,  may  be  remedied.  As  /90  (x)  has  no  other  values  than  0  and  1, 
when  x  is  integral,  we  can  put  for  instance 


by  (10)  then 


7CX 


(13) 


Here  the  values  of  the  binomial  function  are  constantly  positive  or  0.  But  this 
form  is  cumbersome ;  and  although  for  x  =  co  the  function  and  its  principal  coefficients 
are  =  0,  this  property  is  lost  here,  when  we  multiply  by  integral  algebraic  or  by  exponen- 
tial functions. 

These  unfavourable  circumstances  Detract  greatly  from  the  merits  of  the  binomial 
functions  as  expressions  for  continuous  laws  of  errors. 

When,  on  the  contrary,  the  observations  correspond  only  to  integral  values  of  the 
argument,  the  original  binomial  functions  are  most  valuable  means  for  treating  them.  That 
ftn(x)  =  0,  if  x>n  or  negative,  is  then  of  great  importance.  But  this  case  must  be  referred 
to  special  investigations. 

§  20.  To  represent  non-typical  laws  of  errors  in  functional  form  we  have  now 
the  choice  between  at  least  three  different  plans: 


22 
1)    the  formula  (1)  or 


2)     the  products  of  integral  algebraic  functions  by  a  typical  function  or  (6) 


3)    a  sum  of  several  typical  functions 

y  =  2fo-W*T)  .  (14) 

This  account  of  the  more  prominent  among  the  functional  forms,  which  we  have  at  our 
disposal  for  the  representation  of  laws  of  errors,  may  prove  that  we  certainly  possess  good 
instruments,  by  means  of  which  we  can  even  in  more  than  one  form  find  general  series 
adapted  for  the  representation  of  laws  of  errors.  We  do  not  want  forms  for  the  series, 
required  in  theoretical  speculations  upon  laws  of  errors;  nor  is  the  exact  representation  of 
the  actual  frequencies  more  than  reasonably  difficult.  If  anything,  we  have  too  many  forms 
and  too  few  means  of  estimating  their  value  correctly. 

As  to  the  important  transition  from  laws  of  actual  errors  to  those  of  presumptive 
errors,  the  functional  form  of  the  law  leaves  us  quite  uncertain.  The  convergency  of  the 
series  is  too  irregular,  and  cannot  in  the  least  be  foreseen. 

We  ask  in  vain  for  a  fixed  rule,  by  which  we  can  select  the  most  important  and 
trustworthy  forms  with  limited  numbers  of  constants,  to  be  used  in  predictions.  And  even 
if  we  should  have  decided  to  use  only  the  typical  form  by  the  laws  of  presumptive  errors, 
we  still  lack  a  method  by  which  we  can  compute  its  constants.  The  answer,  that  the 
"adjustment"  of  the  law  of  errors  must  be  made  by  the  "method  of  least  squares",  may 
not  be  given  till  we  have  attained  a  satisfactory  proof  of  that  method;  and  the  attempts 
that  have  been  made  to  deduce  it  by  speculations  on  the  functional  laws  of  errors  must, 
I  think,  all  be  regarded  as  failures. 

VI.     LAWS   OF   ERRORS 

EXPRESSED  BY  SYMMETRICAL  FUNCTIONS. 

§  21.  All  constants  in  a  functional  law  of  errors,  every  general  property  of  a 
curve  of  errors  or,  generally,  of  a  law  of  numerical  errors,  must  be  symmetrical  functions 
of  the  several  results  of  the  repetitions,  i.  e.  functions  which  are  not  altered  by  inter- 
changing two  or  more  of  the  results.  For,  as  all  the  values  found  by  the  repetitions 
correspond  to  the  same  essential  circumstances,  no  interchanging  whatever  can  have  any 
influence  on  the  law  of  errors.  Conversely,  any  symmetrical  function  of  the  values  of  the 
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observations  will  represent  some  property  or  other  of  the  law  of  errors.  And  we  must  be 
able  to  express  the  whole  law  of  errors  itself  by  every  such  collection  of  symmetrical 
functions,  by  which  every  property  of  the  law  of  errors  can  be  expressed  as  unambiguously 
as  by  the  very  values  found  by  the  repetitions. 

We  have  such  a  collection  in  the  coefficients  of  that  equation  of  the  wlh  degree, 
whose  roots  are  the  n  observed  values.  For  if  we  know  these  coefficients,  and  solve  the 
equation,  we  get  an  unambiguous  determination  of  all  the  values  resulting  from  the  repe- 
titions, i.e.  the  law  of  errors.  But  other  collections  also  fulfil  the  same  requirements;  the 
essential  thing  is  that  the  n  symmetrical  functions  are  rational  and  integral,  and  that  one 
of  them  has  each  of  the  degrees  1,  2  . . .  «,  and  that  none  of  them  can  be  deduced  from 
the  others. 

The  collection  of  this  sort  that  is  easiest  to  compute,  is  the  sums  of  the  powers. 
With  the  observed  values 

ox,  o.2,  o3,  ...  on 
we  have 

s0  =  o°  -j-  o\  -}-  •  •  -f  On   =  n 


_    0\  -f  0\  -|-  .  .  -f  Ol 

V  I  **  I  I  ' 

=    0 .   -f-  0     -4-  .  .  H—  Ot 

II         a       I 


(15) 


o 

and  the  fractions  -     may  also  be  employed  as  an  expression  for  the  law  of  errors;  it  is 

it> 

only  important  to  reduce  the  observations  to  a  suitable  zero  which  must  be  an  average 
value  of  ol  ...on-,  for  if  the  differences  between  the  observations  are  small,  as  compared 
with  their  differences  from  the  average,  then 


may  become  practically  identical,  and  therefore  unable  to  express  more  than  one  property 
of  the  law  of  errors. 

From  a  well  known  theorem  of  the  theory  of  symmetrical  functions,  the  equations 
1  -f-  ava>  -f-  a2<y2  -j-  .  .  .  =  (1  —  Oj<y)  (1  —  02a>)  ...  {1  —  onco) 


which  are  identical  with  regard  to  every  value  of  o>,  we  learn  that  the  sum  of  the  powers 
sr  can  be  computed  without  ambiguity,  if  we  know  the  coefficients  <ir  of  the  equation, 
whose  roots  are  the  n  observations;  and  vice  versa,  by  differentiating  the  last  equation 
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with  regard  to  to,  and  equating  the  coefficients  we  get 

0  ==  al  -f-  s, 

0  =  2«2  -(-  «,«!  4-  .<?2 


(16) 


0  =  nan  4-  ara_i  sl  -f  . .  .  4-  a,  sn_,  -|-  *„ 

from  which  the  coefficients  an  are  unambiguously  and  very  easily  computed,  when  the  sn 
are  directly  calculated. 

§  22.  But  from  the  sums  of  powers  we  can  easily  compute  also  another  service- 
able collection  of  symmetrical  functions,  which  for  brevity  we  shall  call  the  half-invariants. 

Starting  from  the  sums  of  powers  sr,  these  can  be  defined  as  p.l,  ^/2,  ^3,  by  the 
equation 

which  we  suppose  identical  with  regard  to  r. 
As  sr  =  2V,  this  can  be  written 


•„  nr+y '+„.«!"*  as) 

By  developing  the  first  term  of  (17)  as  2krrr  and  equating  the  coefficients  of  each 

a 

power  of  r,  we  get  each  --  expressed  as  a  function  of  jul  ...fir: 


(19) 


Taking  the  logarithms  of  (17)  we  get 


|i  '      l» 


|3< 


^    ,   ^^,         } 

»0  !1  '  s«  j?  •  *n  !» 


(20) 


and  hence 


=  (*,  «2  —  38,  s,  «0 


(21) 


The  general  law   of  the  relation  between   the  ^  and  .s  is  more   easily   understood 
through  the  equations 
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«    — 


-f  2/^s,  -f 


(22) 


the  numerical  coefficients  are  those  of  the  binomial  theorem.    These  equations  can 
be  demonstrated  by  ditferentiation  of  (17)  with  regard  to  r,  the  resulting  equation 

(23) 


being  satisfied  for  all  values  of  r  by  (22). 

These  half-invariants  possess  several  remarkable  properties.    From  (18)  we  get 

.s0e£T       II  T       "  =  e  T+...+el  (24) 

consequently  any  transformation   o'  =  o  +  c,   any   change  of  the  zero  of  all  observations 
ol  ...r>B,  affects  only  /^  in   the  same  manner,   but  leaves  /^2,  /^3,  //4,  ...  unaltered;  any 
change  of  the  unit  of  all  observations  can  be  compensated  by  the  reciprocal  change  of  the 
unit  of  r,  and  becomes  therefore  indifferent  to  //2r2,  //3r8,  . . . 
Not  only  the  ratios 

SO  Q 

1          &2  *>n 

T"'      V'    *  "  T~ 

S(}  *0  SQ 

but  also  the  half-invariants   have  the  property  which  is  so  important  in   a  law  of  errors, 
of  remaining  unchanged  when  the  whole  series  of  repetitions  is  repeated  unchanged. 

We  have  seen  that  the  typical  character  of  a  law  of  errors  reveals  itself  in  the 
elegant  functional  form 

.  ,-!(*-?)'. 


Now  we  shall  see  that  it  is  fully  as  easy  to  recognize  the  typical  laws  of  errors  by  means 
of  their  half-invariants.  Here  the  criterion  is  that  pr  =  0  if  r>3,  while  f^^m  and 
//2  =  Ma.  This  remarkable  proposition  has  originally  led  me  to  prefer  the  half-invariants 
to  every  other  system  of  symmetrical  functions;  it  is  easily  demonstrated  by  means  of  (5), 
if  we  take  m  for  the  zero  of  the  observations. 

We  begin  by  forming  the  sums  of  powers  .<?r  of  that   law  of  errors  where  the  fre- 
quency of  an  observed  x  is  proportional  to  y>  (x)  =  e~~z\^)  ;  as  this  law  is  continuous 

we  get 

C+  °° 
sr  =  \xr<p(x}dx. 

v —  oo 

4 
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For  every  differential  coefficient  Dm<p(x)  we  have 

° 

my(x)'dx  =  Z)m~V(cc)  —  Dm^tp(~  oo)  =  0, 

0 

consequently  we  learn  from  (5)  that  s2r+i  =  0,  but 

s2  =  1  •  w2  s0 
*4  =  1  •  3  •  n*  s0 
SR  =  1  •  3  -5  -n6  sn 


(compare  problem  3,  §  18).    Now   the  half-invariants  can  be  found  by  (22)  or  by  (17).    If 
we  use  (22)  we  remark  that  s2r  ==  w.2  (2r  —  l)s2»--2;  then  writing  for  (22) 

=  0 

=0 


=    0 

s6  —  5^2s4  =  //jSB  -f-  10/^8sa  +  10//4.<?2  +  5//5S!  +^fi,<?o  ==  0 

we  see  that  the  solution  is  //2  =  w2  and  (JLI  =//8  =^4  =  ...  =  0. 
By  (17)  we  get 

••  2  ,M!  + 


Equating  the  coefficients  of  rr  we  get  here  also  ;JLI  =  0  =  w,  //2  =  w2,  /^r  ==  0 
if  r  ^  3. 

If  we  wish  to  demonstrate  this  important  proposition  without  change  of  the  zero, 
and  without  the  use  of  the  equations  (3)  whose  general  demonstration  is  somewhat  diffi- 
cult, we  can  commence  by  the  lemma  that,  for  each  integral  and  positive  value  of  r,  and 
also  for  r  =•-=  0,  -we  have  for  the  typical  law  of  errors 

Sr-t-i  —  msr  -j-rw2sr-i. 

_  1  /*—  m\* 
The  function  </>(x)  =  »2afe   T\  *  /  is  equal  to  zero  both  for  #=00  and  for  x  =  —  :c; 

if  we  now  between  these  limits  integrate  its  differential  equation 

dtl>(x} 

_l_>_^  =  (rn2xr't  —  (x  — 

we  get 

0  =  —  sr+i  -\-  msr  -\-  r  n 
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irhere 


I ,.,   »,  I 
p«~7l~5  )  </./•. 


If  we  now  from  (22)  suMniH,  term  liy  term,  the  equations 

I  '' 


.S' 


it  is  obvious  that  /;,  —  m  =  0,  //2  =-  «*,  ^8  =//4  =  ...  =  0. 

By  computation  of  /^  and  ^.2  we  find  consequently,  in  the  simplest  way,  the 
constants  of  a  typical  law  of  errors. 

If  the  law  of  errors  deviates  only  a  little  from  the  typical  form,  /^3,  //4,  etc.,  will 
also,  all  of  them,  be  relatively  small  numbers;  and  each  of  them  may  be  either  positive 
or  negative. 

On  the  whole,  a  law  of  errors  can  be  determined  without  ambiguity  by  the  values 
;/,,  ^2,  .  .  .  //r,  r  being  the  number  of  repetitions.  From  any  such  ps  we  can  compute 
the  sums  of  the  powers  s  unambiguously,  and  from  these  again  the  coefficients  of  the 
equation  whose  roots  are  the  observed  values. 

But  for  real  laws  of  errors  it  is  a  necessary  condition  that  no  imaginary  root 
can  be  admitted.  If  an  infinite  number  of  repetitions  is  considered,  the  equation  ceases 
to  be  algebraic,  and  then  the  convergency  of  the  series  necessary  for  its  solution  is  a 
further  condition. 


§  23.     The  mean  value  ^  =  ^-  =  Q|+Qa+    -^   is  always  greater  than  the 

SQ  fl 

least,  less  than  the  greatest  of  the  observed  values  ox,  o2,  ...  on;  under  typical  circum- 
stances we  shall  find  almost  the  same  number  of  greater  and  less  values  of  the  observations. 
The  majority  of  them  lie  rather  near  to  //,;  only  few  very  distant  from  it.  The  mean 
value  is  the  simplest  representative  of  what  is  common  in  a  series  of  values  found  by 
repetition;  its  application  as  such  is  most  likely  exceedingly  old,  and  marks  in  the  history 
of  science  the  first  trace  of  a  theory  of  observations. 

The  mean  deviation,  whose  square  is  =//2*  measures  the  magnitude  of  the  devia- 
tions, the  uncertainty  of  the  repeated  actual  observations.  The  square  of  the  mean  deviation 
is  the  mean  of  the  squares  of  the  deviations  of  the  several  observations  from  their  mean 
value.  By  addition  of 

4* 
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(OH 
we  get 


and  as  ^t  =  — 


The  computation  of  /z2  by  this  formula  will  often  be  easier  than  by  the  equation 
(21),  because  s2  in  the  latter  must  frequently  be  computed  with  more  figures.  There  is 
however  a  middle  course,  which  is  often  to  be  preferred  to  either  of  these  methods  of  com- 
putation. As  a  change  in  the  zero  of  the  observations  involves  the  same  increase  of  every 
o  and  of  //n  it  will,  according  to  (24),  have  no  influence  at  all  on  /JLZ.  We  select  therefore 
as  zero  a  convenient,  round  number,  c,  very  near  p^  and  by  reference  to  this  zero  the 
observed  values  are  transformed  to 

0(    =    Ol  —  C  ,     0[    =    02  -  C,    ...    o'n   =    On—  C. 

When  s\  and  s'2  indicate  the  sums  of  the  transformed  observations,  and  p\  ==f*i  —  c,  then 

.  2(o  —  c) 
we  have  «  .  =  c  -\  —       —  '  and 

~*  in 


(26) 


We  have  still  to  mention  a  theorem  concerning  the  mean  deviation,  which,  though 
not  useful  for  computation,  is  useful  for  the  comprehension  and  further  development  of  the 
idea:  The  square  of  the  mean  deviation  ^2  is  equal  to  the  sum  of  squares  of  the  difference 
between  each  observed  value  and  each  of  the  others,  divided  by  twice  the  square  of  the 
number.  The  said  squares  are: 

(°i  —  °i)*'  K—  °i)2'   ••••   (°»  —  °i)2» 
(ot  —  o2)2,  (o2  —  o2)2,    ....   (o»--o2)2, 


(0,   —  Ow)2,    (02  —  0B)2,     (On  —  On)*  ; 

developing  each  of  these  by  the  formula  (om  —  on)2  =  o?n  —  2omore  +  ol ,  and  first  adding  each 
column  separately,  we  find  the  sums 
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—  2s,o,  -f 


*X  —  2.S-.0,,  -f   K, 

and  the  sum  of  these 

808,  -  28,8,  -f  *,*«,    -=    2(S0*,  —  Sj), 

consequently, 

2'2>r-0,)"  -  2«;/v  (27) 

The  mean  deviation  is  greater  than  the  least,  less  than  the  greatest  of  the  deviations  of 
the  values  of  repetitions  from  the  mean  number,  and  less  than  ]/\  of  the  greatest  deviation 
between  two  observed  values. 

As  to  the  higher  half-invariants  it  may  here  be  enough  to  state  that  they  indicate 
various  sorts  of  deviations  from  the  typical  form.  Skew  curves  of  errors  are  indicated  by 
the  //2r+t  being  different  from  zero,  peaked  or  flattened  (divided)  forms  respectively  by 
positive  or  negative  values  of  ^4r,  and  inversely  by  /ur+2. 

For  these  higher  half-invariants  we  shall  propose  no  special  names.  But  we  have 
already  introduced  double  names  "relative  frequency"  and  "probability"  in  order  to  accen- 
tuate the  distinction  between  the  laws  of  actual  errors  and  those  of  presumptive  errors, 
and  the  same  we  ought  to  do  for  the  half-invariants.  In  what  follows  we  shall  indicate 
the  half-invariants  in  laws  of  presumptive  errors  by  the  signs  Ar  instead  of  pr  which  will 
be  reserved  for  laws  of  actual  errors,  particularly  when  we  shall  treat  of  the  transition 
from  laws  of  actual  errors  to  those  of  presumptive  ones.  For  special  reasons,  to  be 
explained  later  on,  the  name  mean  value  can  be  used  without  confusion  both  for  ^jt  and 
y^,  for  actual  as  well  as  for  presumptive  means;  but  instead  of  "mean  deviation"  we  say 
"mean  error",  when  we  speak  of  laws  of  presumptive  errors.  Thus,  if  n  —  s0  , 


is  called  the  square  of  the  mean  error. 

In  speculations  upon  ideal  laws  of  errors,  when  the  laws  are  supposed  to  be  con- 
tinuous or  to  relate  to  infinite  numbers  of  observations,  this  distinction  is  of  course 
insignificant. 

Examples: 

1.  Professor  Jul.  Thomsen  found  for  the  constant  of  a  calorimeter,  in  experiments 
with  pure  water,  in  seven  repetitions,  the  values 

2649,  2647,  2645,  2653,  2653,  2646,  2649. 

If  we  take  here  2650  as  zero,  we  read  the  observations  as 

-1,  -3,  -5,  +3,  +  3,  -4,  -1 
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so  that 

s'0  =  7,   s\  =  —  8,    and    s'a   =  70; 

consequently 

fjtl  -=•  2650  —  4  ==  2649. 


,,  _ 

-"2     -  '     7 

The  mean  deviation  is  consequently  -4-  3. 

2.  In  an   alternative  experiment  the  result  is   either  uyes",  which  counts   1,   or 
"no",  which  counts  0.     Out  of  m  -\-  n  repetitions  the  m  have  given    "yes",   the  n  "no". 
What  then  is  the  expression  for  the  law  of  errors  in  half-invariants? 

m  mn  inn  (n  —  m)  mn(m*  —  4mn4-nz) 

Answer:  u,  =  -       —  ,  «a  =  -.  —  .  —  -r^  ,  ,u3  =  —,  —  H  —  r?r»  f**  =  ~     —  ?  -  i  —  • 

m  4-  n    t^i        (in  -f-w)      '  (m-\-  n)d      l  *  (m  -f-  n)* 

3.  Determine  the  law  of  errors,  in  half-invariants,  of  a  voting  in  which  a  voters 
have  voted  for  a  motion  (-)-  1),  c  against  (—  1),  while  b  have  not  voted  (0),   and  examine 
what  values  for  «,  b,  and  c  give  the  nearest  approximation  to  the  typical  form. 

a  —  c  ab  4-  4ca  -\-  be  _  (c  —  a)  (ab  -\-  Sea  -j-  6c  —  b'2) 

- 


_  ((a  +  c-)  (o  -f  6  -f  c)  —  4  (a  —  c)'2)  (a  +  b  -f  c-)  (2a  —  &  -f  2c)  -f  6  (a  —  r;}4 
^4   '  ~~(a  _j_  ftq_  C)4- 

Disregarding  the  case  when  the  vote  is  unanimous,  the  double  condition  ^3=//4 
=  0  is  only  satisfied  when  one  sixth  of  the  votes  is  for,  another  sixth  against,  while  two 

thirds  do  not  give  their  votes.     If  ^a  is  to  be  =  0,   without   a  being  =6-,  62  —  b(a^-c) 

(a  _  c    \  2 
-  1  ,   which  does  not  disappear  unless 
a  -j-o-f-c/ 

two  of  the  numbers  «,  i,  and  c,  and  consequently  //2,  are  =  0. 

4.     Six  repetitions  give  the  quite  symmetrical  and  almost  typical  law  of  errors, 
fj.l  =  0,  //2  =  •£,  ^3  =^4  =//5  =  0,  but  ^g  ==  —  f.    What  are  the  observed  values? 

Answer:   —  1,  0,  0,  0,  0,  -f  1. 


VII.     RELATIONS  BETWEEN  FUNCTIONAL  LAWS  OF  ERRORS 

AND  HALF-INVARIANTS. 

§  24.  The  multiplicity  of  forms  of  the  laws  of  errors  makes  it  impossible  to  write 
a  Theory  of  Observations  in  a  short  manner.  For  though  these  forms  are  of  very  different 
value,  none  of  them  can  be  considered  as  absolutely  superior  to  the  others.  The  functional 
form  which  has  been  universally  emploj'ed  hitherto,  and  by  the  most  prominent  writers,  has 
in  my  opinion  proved  insufficient.  I  shall  here  endeavour  to  replace  it  by  the  half-invariants. 
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But  even  if  I  should  succeed  in  this  endeavour,  I  am  sure  that  not  only  the  functional 
laws  of  errors,  but  even  the  curves  of  errors  and  the  tables  of  frequency  are  too  important 
ami  natural  to  be  put  completely  aside  without  detriment. 

Moreover,  in  proposing  a  new  plan  for  this  theory,  I  have  felt  it  my  duty  to  explain 
;is  precisely  and  completely  as  possible  its  relation  to  the  old  and  commonly  known  methods. 
I  therefore  consider  it  a  matter  of  great  importance  that  even  the  half-invariants,  in  tlmir 
very  definition,  present  a  natural  transition  to  the  frequencies  and  to  the  functional  law 
of  errors. 

If  in  the  equation  (18) 


ne 


o,r 
=  e     -f  .  .  .  -j-  « 


some  of  the  o,-s  are  exactly  repeated,  it  is  of  course  understood  that  the  term  e°tr  must 
be  counted  not  once  but  as  often  as  o,  is  repeated.  Consequently,  this  definition  of  the 
half-invariants  may,  without  any  change  of  sense,  be  written 

Mi-.-Uj-J./il-J, 

" 


=  2y>  (o,)  e°ir  (28) 

where  the  frequencies  ^(o.)  are  given   in   the   form   of  the  functional  law  of  errors.     For 
continuous  laws  of  errors  the  definition  must  be  written 

^T4-^lr».4-^Ts  +          p+0° 

e\L  '.\>(o)do  =     <p(o)eordo.  (29) 


Ts  +          p+0°  f+oc 

'.\y>(o)do  =  \<p(o)eordo. 

J  —  oo  J  —  oo 


Thus,  if  we  know  the  functional  law  of  errors  and  if  we  can  perform  the  integrations,  the 
half-invariants  may  be  found.  If,  inversely,  we  know  the  ^,,  then  it  may  be  possible  also 
to  determine  the  functional  law  of  errors  tp  (o). 

Example  1.    Let  y(o)  be  a  sum  of  typical  functional  laws  of  errors, 


<p(o]  a 
.+• 

then  \<f>(o)clo  ==  K27r2W?»  and 

J—  00 


+  '  ,)»  —  2n,2ro) 


C        x   or  j  vz   f 

\(p(o}eordo  =  2hA 

J  —  oo  J 


and  consequently 
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o 

By  aid  of  the  formulae  (19)  that  express  --  as  functions  of  the  ^   (or  p)  it  is  not 

so 
difficult  to  compute  the  principal  half-invariants.    The  inverse  problem,  to  compute  the 

m,-,  Hi,  and  hi  by  means  of  given  half-invariants  is  very  difficult,  .as  it  results  in  equa- 
tions of  a  high  degree,  even  if  only  a  sum  of  two  typical  functional  laws  of  errors  is 
in  question. 

Example  2.    What  are  the  half-invariants  of  a  pure  binomial  law  of  errors?    The 
observation  r  being  repeated  ftn(r)  times,  we  write 


consequently 


Here  the  right  hand  side  of  the  equation  can  be  developed  by  the  aid  of  Bernoullian 
numbers  into  a  series  containing  only  the  even  powers  of  r,  consequently 

//• 

ft  i  =  —  -Q-    and    «2r+i  =  0,     (r>0) 


2 

further 

_n_  _n_  n_  17  _  31 

Example  3.    What   are  the  half-invariants  of  a  complete  binomial  law  of  errors 
(the  complete  terms  of  (p-\-q)*)?     Here 

7rT  +  7^7"2+^8T3  +  ... 


From  this  we  obtain  by  differentiation  with  regard  to  r 

I     tl*          I      ^  3      2      '     /*4      '!  W2*T 


by  further  differentiation 


putting  T  =  0  we  get 


p»+i  -\ — ^nr  + . . .  = 


npq 
Pi  — 


npq(p-q) 
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'/' 
npq 


: 


Example  4.  A  law  of  presumptive  errors  is  given  by  its  half-invariants  forming 
a  geometrical  progression,  jlr  =  bar.  Determine  the  several  observations  and  their  frequencies. 
Here  the  left  hand  side  of  the  equation  (18)  is 


aT          («r)»         (aT)B 

+      -          • 


but  this  is  =  s0e-6|l  -f  6e°T-(-^e2or-f  -jg-e^-f-...*  and  has  also  the  form  of  the  right 

side  of  (18).    Thus  the  observed  values  are  0,  a,  2a,  3a,  ...  and  the  relative  frequency  of 

If 
ra  is  -^  =  <f(r).    This  law  of  errors  is  nearly  related  to  the  binomial  law,  which  can  be 

considered  as  a  product  of  two  factors  of  this  kind, 


|r      |n— r  \n  ^nv  ' 

It   is   perhaps   superior    to    the    binomial   law   as   a   representative   of  some   skew   laws 
of  errors. 

Example  5.  A  law  of  errors  has  the  peculiarity  that  all  half-invariants  of  odd 
order  are  =  0,  while  all  even  half-invariants  are  equal  to  each  other,  /I2r  =  2a.  Show 
that  all  the  observations  must  be  integral  numbers,  and  that  for  the  relative  frequencies 


-*> 

'-  ^  ~~l    "• 


Example  6.  Determine  the  half-invariants  of  the  law  of  presumptive  errors  for  the 
irrational  values  in  the  table  of  a  function,  in  whose  computation  fractions  under  *  have 
been  rejected  and  those  over  £  replaced  by  1  : 

A2r+l    =    0,      X2     =     j-?  '      A4     =    -- 


§  25.    As  a  most  general  functional  form  of  a  continuous  law  of  errors  we  have 
proposed  (6) 


where  y>  (x) 
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Now  it  is  a  very  remarkable  thing  that  we  can  express  the  half-invariants  without 
any  ambiguity  as  functions  of  the  coefficients  &,-,  and  vice  versa. 
By  (29)  we  get 


»  ,»+»  ,  , 

•  \      nT"  /7  /    \  \     T\       /\l 


where  s0  =  nk0]/2x.    By  means  of  the  lemma 


which  is  easily  demonstrated  for   any  <p  (o)  by  differentiating  with   regard  to  o  only  ,   we 
have  in  this  particular  case,  where  y>  (o)  and  every  D'  <p(o)  is  =  0,  if  o  =  +  oo  , 

SI  10—  m\2 
e°T  Dr  e~^(^T)  do  = 

-—oo 

Consequently,  the  relation  between  the  half-invariants  on  one  side  and   the  coefficients  k 
of  the  general  functional  law  of  errors  on  the  other,  is 

1    T-L       *    T*  -L       '    T3  J- 

+  "  (30) 


If  we  write  here  /t  =  ^t  —  m  and  /«,  =  ^2  —  w2,  the  computation  of  one  set  of 
constants  by  the  other  can,  according  to  (17),  be  made  by  the  formulae  (19)  and  (21).  We 
substitute  only  in  these  the  &,-  for  the  s,-,  and  A'  or  X  for  ju. 

It  will  be  seen  that  the  constants  m  and  w,  and  the  special  typical  law  of  errors 
to  which  they  belong,  are  generally  superfluous.  This  superfluity  in  our  transformation 
may  be  useful  in  special  cases  for  reasons  of  convergency,  but  in  general  it  must  be  con- 
sidered a  source  of  vagueness,  and  the  constants  must  be  fixed  arbitrarily. 

It  is  easiest  and  most  natural  to  put 

m  =  /I,   and   w2  =  ^2  » 
In  this  case  we  get  kl  =0,  k2  =0,  &3  =&0^3,  k4  =  fcn^4,  k5  —  &0>}5,  and  further 


The  law  of  the  coefficients  is  explained  by  writing  the  right  side  of  equation  (30) 


fco  «wt 
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Expressed  by  half-invariants  in  this  manner  the  explicit  form   of  eq  nation  (6)  is 


(31) 


24UJ 


VIII.     LAWS  OF  ERRORS  OF  FUNCTIONS  OF  OBSERVATIONS. 

§  26.  There  is  nothing  inconsistent  with  our  definitions  in  speaking  of  laws  of  errors 
relating  to  any  group  of  quantities  which,  though  not  obtained  by  repeated  observations, 
have  the  like  property,  namely,  that  repeated  estimations  of  a  single  thing  give  rise,  owing 
to  errors  of  one  kind  or  other,  to  multiple  and  slightly  differing  results  which  are  prima 
facie  equally  valid.  The  various  forms  of  laws  of  actual  errors  are  indeed  only  summary 
expressions  for  such  multiplicity;  and  the  transition  to  the  law  of  presumptive  errors 
requires,  besides  this,  only  that  the  multiplicity  is  caused  by  fixed  but  unknown  circum- 
stances, and  that  the  values  must  be  mutually  independent  in  that  sense  that  none  of  the 
circumstances  have  connected  some  repetitions  to  others  in  a  manner  which  cannot  be 
common  to  all.  Compare  §  24,  Example  6. 

It  is,  consequently,  not  difficult  to  define  the  law  of  errors  for  a  function  of  one 
single  observation.  Provided  only  that  the  function  is  univocal,  we  can  from  each  of  the 
observed  values  on  o>£  ...  on  determine  the  corresponding  value  of  the  function,  and 

A'i),  fM,  ...A*.) 

will  then  be  the  series  of  repetitions  in  the  law  of  errors  of  the  function,  and  can  be 
treated  quite  like  observations. 

With  respect,  however,  to  those  forms  of  laws  of  errors  which  make  use  of  the 
idea  of  frequency  (probability)  we  must  make  one  little  reservation.  Even  though  0,  and 
ok  are  different,  we  can  have  /"(o,)  =  f(ok),  and  in  this  case  the  frequencies  must  evidently 
be  added  together.  Here,  however,  we  need  only  just  mention  this,  and  remark  that  the 
laws  of  errors  when  expressed  by  half-invariants  or  other  symmetrical  functions  are  not 
influenced  by  it. 

Otherwise  the  frequency  is  the  same  for  f(ot)  as  for  o,-,  and  therefore  also  the 
probability.  The  ordinates  of  the  curves  of  errors  are  not  changed  by  observations  with 
discontinuous  values;  but  the  abscissa  o,  is  replaced  by  /"(o,-),  and  likewise  the  argument 
in  the  functional  law  of  errors.  In  continuous  functions,  on  the  other  hand,  it  is  the 
areas  between  corresponding  ordinates  which  must  remain  unchanged. 
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In  the  form  of  symmetrical  functions  the  law  of  errors  of  functions  of  observations 
may  be  computed,  and  not  only  when  we  know  all  the  several  observed  values,  and  can  there- 
fore compute,  for  each  of  them,  the  corresponding  value  of  the  function,  and  at  last  the 
symmetrical  functions  of  the  latter.  In  many  and  important  cases  it  is  sufficient  if  we 
know  the  symmetrical  functions  of  the  observations,  as  we  can  compute  the  symmetrical 
functions  of  the  functions  directly  from  these.  For  instance,  if  f(o)  =  o2 ;  for  then  the 
sums  of  the  powers  s'n  of  the  squares  are  also  sums  of  the  powers  sm  of  the  observations, 
if  only  constantly  m  =  2n ;  s'0  —  s0,  s' l  =  s2 ,  s'2  =  s4,  etc. 

§  27.  The  principal  thing  is  here  a  proposition  as  to  laws  of  errors  of  the  linear 
functions  by  half-invariants. 

[t  is  almost  self-evident  that  if  a'  =  ao-\-b 


„—  (32) 

etc. 
p'r   =  »>  (r  >  1) 

For  the  linear  functions  can  always  be  considered  as  produced  by  the  change  of 
both  zero  and  unity  of  the  observations  (Compare  (24)). 

However  special  the  linear  function  ao  -j-  b  may  be,  we  always  in  practice  manage 
to  get  on  with  the  formula  (32).  That  we  can  succeed  in  this  is  owing  to  a  happy 
circumstance,  the  very  same  as,  in  numerical  solutions  of  the  problems  of  exact  mathematics, 
brings  it  about  that  we  are  but  rarely,  in  the  neighbourhood  of  equal  roots,  compelled  to 
employ  the  formulas  for  the  solution  of  other  equations  than  those  of  the  first  degree. 
Here  we  are  favoured  by  the  fact  that  we  may  suppose  the  errors  in  good  observations 
to  be  small,  so  small  -  -  to  speak  more  exactly  -  -  that  we  may  generally  in  repetitions 
for  each  series  of  observations  ol^  o2,  . . .  o»  assign  a  number  c,  so  near  them  all  that 
the  squares  and  products  and  higher  powers  of  the  differences 

0  j  C)    t/2  C^    •  •  •    Oft  6 

without  any  perceptible  error  may  be  left  out  of  consideration  in  computing  the  function: 
i.  e.,  these  differences  are  treated  like  differentials.  The  differential  calculus  gives  a  definite 
method,  in  such  circumstances,  for  transforming  any  function  f(d)  into  a  linear  one 

f(o)  - 
The  law  of  errors  then  becomes 
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But  also  by  quite  elementary  means  and  easy  artifices  we  may  often  transform 
functions   into   others  of  linear  form.     If  for   instance  f(o)  =  — ,  then  we  write 

c-(o-c)          1         1 

.-(o-c). 


c  -f  (o  —  c)       c2  —  (o  —  cY 
and  the  law  of  errors  is  then 

*"  (I)  =  T  -  ^><°)- 


§  28.  -With  respect  to  functions  of  two  or  more  observed  quantities  we  may  also, 
in  case  of  repetitions,  speak  of  laws  of  errors,  only  we  must  define  more  closely  what  we 
are  to  understand  by  repetitions.  For  then  another  consideration  comes  in,  which  was  out 
of  the  question  in  the  simpler  case.  It  is  still  necessary  for  the  idea  of  the  law  of  errors 
of  f(o,  o')  that  we  should  have,  for  each  of  the  observed  quantities  o  and  o',  a  series  of 
statements  which  severally  may  be  looked  upon  as  repetitions: 

0,,     02'      .....     °m 


But  here  this  is  not  sufficient.  Now  it  makes  a  difference  if,  among  the  special 
circumstances  by  o  and  o',  there  are  or  are  not  such  as  are  common  to  observations  of  the 
different  series.  We  want  a  technical  expression  for  this.  Here  it  is  not  appropriate  only 
to  speak  of  observations  which  are,  respectively,  dependent  on  one  another  or  independent; 
we  are  led  to  mistake  the  partial  dependence  of  observations  for  the  functional  dependence 
of  exact  quantities.  I  shall  propose  to  designate  these  particular  interdependences  of 
repetitions  of  different  observations  by  the  word  "bond",  which  presumably  cannot  cause 
any  misunderstanding. 

Among  the  repetitions  of  a  single  observation,  no  other  bonds  must  be  found  than 
such  as  equally  bind  all  the  repetitions  together,  and  consequently  belong  to  the  pecularities 
of  the  method.  But  while,  for  instance,  several  pieces  cast  in  the  same  mould  may  be 
fair  repetitions  of  one  another,  and  likewise  one  dimension  measured  once  on  each  piece, 
two  or  more  dimensions  measured  on  the  same  piece  must  generally  be  supposed  to  be 
bound  together.  And  thus  there  may  easily  exist  bonds  which,  by  community  in  a  cir- 
cumstance, as  here  the  particularities  in  the  several  castings,  bind  some  or  all  the  repe- 
titions of  a  series  each  to  its  repetition  of  another  observation;  and  if  observations  thus 
connected  are  to  enter  into  the  same  calculation,  we  must  generally  take  these  bonds  into 
account.  This,  as  a  rule,  can  only  be  done  by  proposing  a  theory  or  hypothesis  as  to  the 
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mathematical  dependence  between  the  observed  objects  and  their  common  circumstance, 
and  whether  the  number  which  expresses  this  is  known  from  observation  or  quite  unknown, 
the  right  treatment  falls  under  those  methods  of  adjustment  which  will  be  mentioned 
later  on. 

It  is  then  in  a  few  special  cases  only  that  we  can  determine  laws  of  errors  for 
functions  of  two  or  more  observed  quantities,  in  ways  analogous  to  what  holds  good  of  a 
single  observation  and  its  functions. 

If  the  observations  o,  o',  o"  .  .  .,  which  are  to  enter  into  the  calculation  of 
f(o,  o',  0",  .  .  .),  are  repeated  in  such  a  way  that,  in  general,  o,-,  o\,  o"<  ...  of  the  fth 
repetition  are  connected  by  a  common  circumstance,  the  same  for  each  «',  but  otherwise 
without  any  other  bonds,  we  can  for  each  i  compute  a  value  of  the  function  yt  = 
f(0i,  o'i,  o"  ',  .  .  .),  and  laws  of  errors  can  be  determined  for  this,  in  just  the  same  way  as 
for  o  separately.  To  do  so  we  need  no  knowledge  at  all  of  the  special  nature  of  the  bonds. 

§  29.  If,  on  the  contrary,  there  is  no  bond  at  all  between  the  repetitions  of  the 
observations  o,  o',  o",  ...  —  and  this  is  the  principal  case  to  which  we  must  try  to  reduce 
the  others  -  -  then  we  must,  in  order  to  represent  all  the  equally  valid  values  of  y  = 
f(o,  o',  o",  .  .  .),  herein  combine  every  observed  value  for  o  with  every  one  for  o',  for  o", 
etc.,  and  all  such  values  of  y  must  be  treated  analogously  to  the  simple  repetitions  of  one 
single  observed  quantity.  But  while  it  may  here  easily  become  too  great  a  task  to  com- 
pute y  for  each  of  the  numerous  combinations,  we  shall  in  this  case  be  able  to  compute 
y's  law  of  errors  by  means  of  the  laws  of  errors  for  o,  o',  o"  .  .  . 

Concerning  this  a  number  of  propositions  might  be  laid  down;  but  one  of  them 
is  of  special  importance  and  will  be  almost  sufficient  for  us  in  what  follows,  viz.,  that 
which  teaches  us  to  determine  the  law  of  errors  for  the  sum  0  of  the  observed  quantities 
o  and  o'. 

If  the  law  of  errors  is  given  in  the  form  of  relative  frequencies  or  probabilities, 
<p(o)  for  o  and  </>(o')  for  o',  then  it  is  obvious  that  the  product  <p(o}(p(o'}  must  be  the  fre- 
quency of  the  special  sum  o  -f-  o'. 

In  the  calculus  of  probabilities  we  shall  consider  this  form  more  closely,  and  there 
some  cases  of  bound  observations  will  find  their  solution;  here  we  shall  confine  ourselves 
to  the  treatment  of  the  said  case  with  half-invariants. 

If  o  occurs  with  the  observed  values 


0,1  o. 
and  o'  with 


then  by  the  mn  repetitions  of  the  operation   0  =  o4-o'  we  get: 
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Om  +  0\  ,    Om  -j-  O',  »     Oro  +  0J,« 

Indicating  by  Afr  the  half-invariants  of  the  sum  0  =  o-f-0',  we  get  by  (18) 

where  m  and  n  are  the  numbers  of  repetitions  of  o  and  o'.    Consequently,  if  (ir  represent 
the  half-invariants  of  o,  and  pr  of  o',  we  get 

g  ]i  '  '    __      g  |1  2  '  '    g  \i 

and  finally 

(34) 


Employing  the  equation  (17)  instead  of  (18)  we  can  also  obtain  fairly  simple 
expressions  for  the  sums  of  powers  of  (o-\-o')  analogous  to  the  binomial  formula.  But  the 
extreme  simplicity  of  (34)  renders  the  half-invariants  unrivalled  as  the  most  suitable  sym- 
metrical functions  and  the  most  powerful  instrument  of  the  theory  of  observations. 

More  generally,  for  every  linear  function  of  observations  not  connected  by  any  bond, 

0  =  a  -f-  bo  -f-  co'  -j-  . . .  do'", 
we  obtain  in  the  same  manner  and  by  (32) 

Mt  (o)  =  62//2  -\-  c'u'j  -|-  . . .  -f-  d*u" 


r  >  1. 


(35) 


When  the  errors  of  observation  are  sufficiently  small,  we  shall  also  here  generally 
be  able  to  give  the  most  different  functions  a  linear  form.  In  consequence  of  this,  the 
propositions  (34)  and  (35)  acquire  an  almost  universal  importance,  and  afford  nearly  the 
whole  necessary  foundation  for  the  theory  of  the  laws  of  errors  of  functions. 

Example  1.  Determine  the  square  of  the  mean  error  for  differences  of  the  w'th 
order  of  equidistant  tabular  values,  between  which  there  is  no  bond,  the  square  of  the 
mean  error  for  every  value  being  =  /12. 
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=  ^2(o4  —  4og-f-6o2 — 4o1-|-o0)  =  70^2 

.2_     6_    10    14       4rc  — 2 

1  '  2  '  3  '  4  '      ~^T^' 

Example  2.  By  the  observation  of  a  meridional  transit  we  observe  two  quantities, 
viz.  the  time,  t,  when  a  star  is  covered  behind  a  thread,  and  the  distance,  /",  from  the 
meridian  at  that  instant.  But  as  it  may  be  assumed  that  the  time  and  the  distance  are 
not  connected  by  a  bond,  and  as  the  speed  of  the  star  is  constant  and  proportional  to  the 
known  value  sin  p  (p  =  polar  distance),  we  always  state  the  observation  by  the  one  quan- 
tity, the  time  when  the  very  meridian  is  passed,  which  we  compute  by  the  formula  o  = 

The  mean  error  is 

j       I  sC\      2       i/l       I       f*  AC  OP  2  /vj   J       |T  I 

A  Q  It/  /      —  —      AO   I*' I       I       v'vOCL'       tJ  A  o   I/    !• 

Example  3.  A  scale  is  constructed  by  making  marks  on  it  at  regular  intervals, 
in  such  a  way  that  the  square  of  the  mean  error  on  each  interval  is  =  ^2. 

To  measure  the  distance  between  two  objects,  we  determine  the  distance  of  each 
object  from  the  nearest  mark,  the  square  of  the  mean  error  of  this  observation  being  =  /'2. 
How  great  is  the  mean  error  in  a  measurement,  by  which  there  are  n  intervals  between 

the  marks  we  use? 

^2  (length)  =  nXt  4-  2/a . 

Example  4.  Two  points  are  supposed  to  be  determined  by  bond-free  and  equally 
good  (^2  =  1)  measurements  of  their  rectangular  co-ordinates.  The  errors  being  small  in 
proportion  to  the  distance,  how  great  is  the  mean  error  in  the  distance  J? 

Example  5.  Under  the  same  suppositions,  what  is  the  mean  error  in  the  inclina- 
tion to  the  «-axis? 


Example  6.  Having  three  points  in  a  plane  determined  in  the  same  manner  by 
their  rectangular  co-ordinates  (xl^y1)^  (x^y^)^  (#3,y3),  find  the  mean  error  of  the  angle 
at  the  point  (x^y^) 

v       j?  +  j>  +  4 
x'('          ~JfJT 

Jn  J2,  J3  being  the  sides  of  the  triangle;  Jj  opposite  to  (x^.y^). 
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K\araples  7  and  8.  Find  the  mean  errors  in  determinations  of  the  areas  of  a 
triangle  and  a  plane  quadrangle. 

>(,  (triangle)  —  J  (  A\  +  J|  +  JJ)  ;    J,  (quadrangle)  —  |  (  J'  +  J'\. 

\i,»      *,«/ 

§  30.  Non-linear  functions  of  more  than  one  argument  present  very  great  difficulties. 
Even  for  integral  rational  functions  no  general  expression  for  the  law  of  errors  can  be  found. 
Nevertheless,  even  in  this  case  it  is  possible  to  indicate  a  method  for  computing  the  half- 
invariants  of  the  function  by  means  of  those  of  the  arguments.  To  do  so  it  seems  indis- 
pensable to  transform  the  laws  of  errors  into  the  form  of  systems  of  sums  of  powers.  If 
0  ==  f(o,  o',...om)  be  integral  and  rational,  both  it  and  its  powers  Or  can  be  written  as 
sums  of  terms  of  the  standard  form  £koa  •  ob  .  .  .  o<'")d,  and  for  every  such  term  the  sum 
resulting  from  the  combination  of  all  repetitions  is  ksa  •  sb  .  .  .  s^p  (including  the  cases 
where  a  or  b  or  d  may  be  =  0),  s^  being  the  sum  of  all  cth  powers  of  the  repetitions  of 
o(').  Thus  if  Sr  indicates  the  sum  of  the  rth  powers  of  the  function  0,  we  get 

Sr  =  2ksa  •  sb  .  .  .  SdW). 

Of  course,  this  operation  is  only  practicable  in  the  very  simplest  cases. 

Example  1.  Determine  the  mean  value  and  mean  deviation  of  the  product  oo'  =  0 
of  two  observations  without  bonds.  Here  S0  =  sns'0  and  generally  Sr  =  srs'r,  consequently 
the  mean  value  Ml  =^,^'1  and 

•    M',  =  /£„//,  +  /i,  X2  4-  ft'lfi\. 
MA  already  takes  the  cumbersome  form 


Example  2.  Express  exactly  by  the  half-invariants  of  the  co-ordinates  the  mean 
value  and  the  mean  deviation  of  the  square  of  the  distance  r2  =  x*  -f-i/2,  if  a?  and  y  are 
observed  without  bonds.  Here 


S2  (r*)  ==  st  (x)  s0  (y)  -f  2s2  (x)  st  (y)  +  s0  (x)  st  (y) 
and 


§  31.    The  most  important  application  of  proposition   (35)  is  certainly  the  deter- 
mination of  the  law  of  errors  of  the  mean  value  itself.     The  mean  value 

J_ 
n 

6 
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is,  we  know,  a  linear  function  of  the  observed  values,  and  we  may  treat  the  law  of  errors 
for  /nl  according  to  the  said  proposition,  not  only  where  we  look  upon  on  ...  on  as  per- 
fectly unconnected,  but  also  where  we  assume  that  they  result  from  repetitions  made 
according  to  the  same  method.  For,  just  like  such  repetitions,  ot,  ...  on  must  not  have 
any  other  circumstances  in  common  as  connecting  bonds  than  such  as  bind  them  all  and 
characterise  the  method. 

As  the  law  of  presumptive  errors  of  ol  is  just  the  same  as  for  o2  ...om,  with  the 
known  half-invariants  yjn  ^2,  ...  /lr  ...,  we  get  according  to  (35) 

1 
tv»tf        ju  v  i          •  ~r    ii          i  ' 


^aC«l)    =  '—. 2-0*2  +  •••   +   ^2)    =    ~^2 
III  Ili- 


(37) 
""•'''         m'f  *  "    '  "'          til    " 

and  in  general 


While,  consequently,  the  presumptive  mean  of  a  mean  value  for  m  repetitions  is 

the  presumptive  mean  itself,  the  mean  error  on  the  mean  value  ul   is  reduced  to   -    -  of 

Vm 

the  mean  error  on  the  single  observation.  When  the  number  m  is  large,  the  formation 
of  mean  values  consequently  reduces  the  uncertainty  considerably;  the  reduction,  however, 
is  proportionally  greater  with  small  than  with  large  numbers.  While  already  4  repetitions 
bring  down  the  uncertainty  to  half  of  the  original,  100  repetitions  are  necessary  in  order 
to  add  one  significant  figure,  and  a  million  to  add  3  figures  to  those  due  to  the  single 
observation. 

The  higher  half-invariants  of  /«1  are  reduced  still  more.  If  the  ^3,  ^4,  etc.,  of 
the  single  observation  are  so  large  that  the  law  of  errors  cannot  be  called  typical,  no  very 
great  numbers  of  m  will  be  necessary  to  realise  the  conditions  ^3(/!*t)  —  0  =  ^4(/*i)  with 
an  approximation  that  is  sufficient  in  practice.  It  ought  to  be  observed  that  this  reduction 
is  not  only  absolute,  but  it  holds  good  also  in  relation  to  the  corresponding  power  of  the 
mean  error  1/>12(^1)2;  for  (37)  gives 


which,  for  instance  when  m  =  4,  shows  that  the  deviation  of  ^3  from  the  typical  form 
which  appears  by  means  of  only  4  repetitions,  is  halved;  that  of  ^4  is  divided  by  4,  that 
of  /1  5  is  divided  by  8,  etc.  This  shows  clearly  the  reason  why  we  attach  great  importance 
to  the  typical  form  for  the  law  of  errors  and  make  arrangements  to  abide  by  it  in  practice. 
For  it  appears  now  that  we  possess  in  the  formation  of  mean  values  a  means  of  making 
the  laws  of  errors  typical,  even  where  they  were  not  so  originally.  Therefore  the  standard 
rule  for  all  practical  observations  is  this  :  Take  care  not  to  neglect  any  opportunities  of 
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repeating  observations  and  parts  of  observations,  so  that  you  can  directly  form  the  mean 
values  which  should  be  substituted  for  the  observed  results;  and  this  is  to  be  done  espe- 
cially in  the  case  of  observations  of  a  novel  character,  or  with  peculiarities  which  lead  us 
to  doubt  whether  the  law  of  errors  will  be  typical. 

This  remarkable  property  is  peculiar,  however,  not  to  the  mean  only,  but  also, 
though  with  less  certainty,  to  any  linear  function  of  several  observations,  provided  only 
the  coefficient  of  any  single  term  is  not  so  great  relatively  to  the  corresponding  deviation 
from  the  typical  form  that  it  throws  all  the  other  terms  into  the  shade.  From  (35)  it  is 
seen  that,  if  the  laws  of  errors  of  all  the  observations  o,  o',  ...  o<m)  are  typical,  the  law 
of  errors  for  any  of  their  linear  functions  will  be  typical  too.  And  if  the  laws  of  errors 
are  not  typical,  then  that  of  the  linear  function  will  deviate  relatively  less  than  any  of  the 
observations  o,  o',  ...  om. 

To  avoid  unnecessary  complication  we  represent  two  terms  of  the  linear  function 
simply  by  o  and  o'.  The  deviation  from  the  typical  zero,  which  appears  in  the  rth  half- 
invariants  (/•  >  2),  measured  by  the  corresponding  power  of  the  mean  error,  will  be  less 
for  0  =  o-\-o'  than  for  the  most  discrepant  of  the  terms  o  and  o'. 

The  inequation 


says  only  that,  if  the  laws  of  errors  for  o  and  o'  deviate  unequally  from  the  typical  form, 
it  is  the  law  of  errors  for  o  that  deviates  most.     But  this  involves 


(fj>\Tr) 

or  more  briefly 


Tr  >  *V 

where  T  is  positive,  r  >  2. 

When  we  introduce  a  positive  quantity   [7,  so  that 

2*  =*  U*  >  R\ 

it  is  evident  that  (Z7-J-1)*  >r  (R-\-\y,  and  it  is  easily   demonstrated  that  (T-f-l)r> 
(P-f-1)'. 

Remembering  that  x  -f  x~l  3>   2,   if  x  >  0,  we  get  by  the  binomial  formula 


^  U+ 
Consequently 


or 


6* 
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and 


but  this  is  the  proposition  we  have  asserted,  for   the  extension  to   any   number  of  terms 
causes  no  difficulty. 

But  if  it  thus  becomes  a  general  law  that  the  law  of  errors  of  linear  functions 
must  more  or  less  approach  the  typical  form,  the  same  must  hold  good  also  of  all  mode- 
rately complex  observations,  such  as  those  whose  errors  arise  from  a  considerable  number 
of  sources.  The  expression  "source  of  errors"  is  employed  to  indicate  circumstances  which 
undeniably  influence  the  result,  but  which  we  have  been  obliged  to  pass  over  as  unessential. 
If  we  imagined  these  circumstances  transferred  to  the  class  of  essential  circumstances,  and 
substantiated  by  subordinate  observations,  that  which  is  now  counted  an  observation  would 
occur  as  a  function,  into  which  the  subordinate  observations  enter  as  independent  variables; 
and  as  we  may  assume,  in  the  case  of  good  observations,  that  the  influence  of  each  single 
source  of  errors  is  small,  this  function  may  be  regarded  as  linear.  The  approximation  to 
typical  form  which  its  law  of  errors  would  thus  show,  if  we  knew  the  laws  of  errors  of 
the  sources  of  error,  cannot  be  lost,  simply  because  we,  by  passing  them  over  as  unessen- 
tial, must  consider  the  sources  of  error  in  the  compound  observation  as  unknown.  More- 
over, we  may  take  it  for  granted  that,  in  systematically  arranged  observations,  every  such 
source  of  error  as  might  dominate  the  rest  will  be  the  object  of  special  investigation  and, 
if  necessary,  will  be  included  among  the  essential  circumstances  or  removed  by  corrective 
calculations.  The  result  then  is  that  great  deviations  from  the  typical  form  of  the  law  of 
errors  are  rare  in  practice. 

§  32.  It  is  of  interest,  of  course,  also  to  acquire  knowledge  of  the  laws  of  errors 
for  the  determinations  of  ^2  and  the  higher  half-invariants  as  functions  of  a  given  number 
of  repeated  observations. 

Here  the  method  indicated  in  §  30  must  be  applied.  But  though  the  symmetry 
of  these  functions  and  the  identity  of  the  laws  of  presumptive  errors  for  on  o2,  ...  om 
afford  very  essential  simplifications,  still  that  method  is  too  difficult.  Not  even  for  ^2  have 
I  discovered  the  general  law  of  errors.  In  my  "Almindelig  lagttagelseslcere"  ,  Kebenhavn 
1889,  I  have  published  tables  up  to  the  eighth  degree  of  products  of  the  sums  of  powers 
sp  sq  .  .  .,  expressed  by  sums  of  terms  of  the  form  o',  o'j,  o"*;  these  are  here  directly  appli- 
cable. In  W.  Fiedler:  "Elemente  der  neueren  Geometric  und  der  Algebra  der  bindren 
Formen",  Leipzig  1862,  tables  up  to  the  10th  degree  will  be  found.  Their  use  is  more 
difficult,  because  they  require  the  preliminary  transformation  of  the  sf  to  the  coefficients 
ap  of  the  rational  equations  §  21.  There  are  such  tables  also  in  the  Algebra  by  Meyer 
Hirsch,  and  Cayley  has  given  others  in  the  Philosophical  Transactions  1857  (Vol.  147, 
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p.  489).    1  have  computed  the  four  principal  half-invariants  of  ^2: 

maX>,(ft^)  =  (m  —  l)*Xt  4  2/w  (»*  —  1)  X] 

48m2(m  —  I)  XI 

in1  X4(f*z)  =  (m  —  l)4/"8  4  24m  (m  —  \)'dX6Xz  4~  32m  (m  —  I)2  (m  —  2)^6/3  4 

4  8m  (m  —  1)  (4m2  —  9m  4  6)X]  4  144m2  (m  —  \)*AtA\  4 
4  96m2  (m  —  1)  (m  —  2)X\Xt  4-  48m3  (m  —  1)  X't. 

Here  m  is  the  number  of  repetitions. 

Of  //3  and  fjtt  only  the  mean  values  and  the  mean  errors  have  been  found: 

m2/*,(^3)  =  (m  —  1)  (m  —  2)<}3  , 


and 


(38) 


+  6m2  (m  —  1)  (m  —  2)/l3,  ; 


;^4)  =  =  (m  —  1)  (mz  ._  6/w  4  6)A\  —  6m  (m  —  1)  XI 

)/  •(  \  o    /        «>  /"*  l       /^  \  o   l  I 

=  (m  —  I)2  (m2    -  bm  -j-  o)2x8  4 

4  8m  (m  —  1)  (m2  —  6m  4  6)  (2m2  —  15m  4 

4  18m  (m  —  1)  (m  —  2)  (m  —  4)  (m2  —  6m  4  6)/"5/i3  4 

4  2m  (m  —  1)  (17w4  —  204m3  4  852m2  —  1404m  4  828)x\5  4 

4  24m2  (m  —  1)  (3m3  —  38m2  4  150m  — 138)/14  XI  4 

4  144m 2  (m  —  1)  (m  —  2)  (m  —  4)  (m  —  5)  X\X^  4 

4  24m3  (m  —  l)(m2—  6m  4  24)^. 

Further  I  know  only  that 

m4/\(//6)  .-  (m  — l)(m  — 2){(m2  — 12m412)>(5— 60mx~2>l3}, 

m'X^fjtt)  ==  (m  — l)(m*— 30m34150m2— 240m4120)/l6 - 

-  30m  (m  —  1)  (7m2  —  36m  4  36)/l4  /" 2  - 

-  60m (m  —  1)  (m  —  2)  (3m  —  8)  XI  - 

-  60m2  (m  —  l)(m  —  6M;, 

m6/}^)  =  (m— l)(m  — 2)(m4  — 60m34420m2— 720m4360)yi7 - 

-  630m (m  —  1)  (m  —  2)  (m2  —  8m  4  8)/l5  Jlg  - 

-  210m  (m  —  1)  (m  —  2)  (7m2  —  4Sm-\- 60)X,  Xt  - 
—  1260m2  (m  —  1)  (m  —  2)  (m  - 


(40) 


(41) 


(42) 


(43) 
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/y.s)  =  (m  —  l)(m6  —  126m5  +  1806m4—  8400m3  -f  16800m2  —  15120m  +  5040)/l    - 

-  56m  (m  —  1)  (31wi4  —540m3  +  2340m2  —  3600m  +  1800)/16/12  - 

-  1680m  (m  —  1)  (m  —  2)  (3m3  —  40m2  -f  120m  —  96)>15/13  - 

-  70m  (m  —  1)  (49m4  —  720m3  -f  3168m2  —  5400m  -f  3240)^  - 

-  840m2  (m  —  1)  (7m3  —  150m2  -f  576m  —  540);  4  *l  - 

-  10080m2  (m  —  1)  (m  —  2)  (m2  —  18m  -f  40)^  „  - 

-  840m3  (m  —  1)  (m2  —  30m  -f  90)  %  .  (44) 

Some  ^/s  of  products  of  the  ^2,  //3,  and  ^4  present  in  general  the  same  charac- 
teristics as  the  above  formulae.    The  most  prominent  of  these  characteristics  are: 

1)  It  is  easily  explained  that  ^  is  only  to  be  found  in  the  equation  >(1  (^J  =  y^; 
indeed  no  other  half-  in  variant  than  the  mean  value  can  depend  on  the  zero  of  the  obser- 
vations.  In  my  computations  this  characteristic  property  has  afforded  a  system  of  multiple 
checks  of  the  correctness  of  the  above  results. 

2)  All  mean  ^  (fjtr)  are  functions  of  the  0th  degree  with  regard  to  m,  all  squares 
of  mean  errors  ^2(//r)  are  of  the  (—  l)st  degree,   and  generally   each  As(/jtr)  is  a  function 
of  the  (1  —  s)th  degree,  in  perfect  accordance  with  the  law  of  large  numbers. 

3)  The  factor  m  —  1  appears  universally  as  a  necessary   factor  of  As(fjtr),  if  only 
/•>!.     If  r  is  an  odd  number,  even  the  factor  m  —  2  appears,   and,   likewise,   if  r  is  an 
even  number,  this  factor  is  constantly  found  in  every  term  that  is  multiplied  by  one  or 
more  ^'s  with  odd  indices.  No  obliquity  of  the  law  of  errors  can  occur  unless  at  least  three 
repetitions  are  under  consideration. 

4)  Many    particulars    indicate    these     functions    as    compounds     of     factorials 
(m  —  1)  (m  —  2)  ...  (m  —  r)  and  powers  of  m. 

If,  supposing  the  presumed  law  of  errors  to  be  typical,  we  put  ^3  =  /}4  =  ...  =  0, 
then  some  further  inductions  can  be  made.    In  this  case  the  law  of  errors  of  ^/2  may  be 


T    ,          (/-*2)   ,,    , 


/  9)      - 

'•—  (l-  ^ 


'•—     l-  j   2    =\<p(o)e*do.  (45) 

As  to  the  squares  of  mean  errors  of  fir  we  get  under  the  same  supposition: 


(46) 

indicating  that  generally 


4? 

This  proposition  is  of  very  great  interest.  If  we  have  a  number  m  of  repetitions 
at  our  disposal  for  the  computation  of  a  law  of  actual  errors,  then  it  will  be  seen  that 
the  relative  mean  errors  of  //n  //2,  //3  .../*,.  are  by  no  means  uniform,  but  increase  with 
the  index  r.  If  m  is  large  enough  to  give  us  /^E  precisely  and  //2  fairly  well,  then  pa  and 
yu4  can  be  only  approximately  indicated;  and  the  higher  half-invariants  are  only  to  be 
guessed,  if  the  repetitions  are  not  counted  by  thousands  or  millions. 

As  all  numerical  coefficients  in  ^2(/ur)  increase  with  r,  almost  in  the  same  degree 
as  the  coefficients  1,  2,  G,  and  24  of  ^,  we  must  presume  that  the  law  of  increasing 
uncertainty  of  the  half-invariants  has  a  general  character. 

We  have  hitherto  been  justified  in  speaking  of  the  principal  half-invariants  as  the 
complete  collection  of  the  /n/s  or  /lr's  with  the  lowest  indices,  considering  a  complete  series 
of  the  first  m  half-invariants  to  be  necessary  to  an  unambiguous  determination  of  a  law  of 
errors  for  m  repetitions. 

We  now  accept  that  principle  as  a  system  of  relative  rank  of  the  half-invariants 
with  increasing  uncertainty  and  consequently  with  a  decreasing  importance  of  the  half- 
invariants  with  higher  indices. 

We  need  scarcely  say  that  there  are  some  special  exceptions  to  this  rule.  For 
instance  if  ^4  =  —  k\ ,  as  in  alternative  experiments  with  equal  chances  for  and  against 
(pitch  and  toss),  then  Az(fiz)  is  reduced  to  =  -  ^~AJ,  which  is  only  of  the  (— 2)nd  order. 

§  33.  Now  we  can  undertake  to  solve  the  main  problem  of  the  theory  of  obser- 
vations, the  transition  from  laws  of  actual  errors  to  those  of  presumptive  errors.  Indeed 
this  problem  is  not  a  mathematical  one,  but  it  is  eminently  practical.  To  reason  from  the 
actual  state  of  a  finite  number  of  observations  to  the  law  governing  infinitely  numerous 
presumed  repetitions  is  an  evident  trespass;  and  it  is  a  mere  attempt  at  prophecy  to 
predict,  by  means  of  a  law  of  presumptive  errors,  the  results  of  future  observations. 

The  struggle  for  life,  however,  compels  us  to  consult  the  oracles.  But  the  modern 
oracles  must  be  scientific;  particularly  when  they  are  asked  about  numbers  and  quantities, 
mathematical  science  does  not  renounce  its  right  of  criticism.  We  claim  that  confusion 
of  ideas  and  every  ambiguous  use  of  words  must  be  carefully  avoided;  and  the  necessary 
act  of  will  must  be  restrained  to  the  acceptation  of  fixed  principles,  which  must  agree 
with  the  law  of  large  numbers. 

It  is  hardly  possible  to  propose  more  satisfactory  principles  than  the  following: 

The  mean  value  of  all  available  repetitions  can  be  taken  directly,  without  any 
change,  as  an  approximation  to  the  presumptive  mean. 

If  only  one  observation  without  repetition  is  known,  it  must  itself,  consequently, 
be  considered  an  approximation  to  the  presumptive  mean  value. 

The  solitary  value  of  any  symmetrical  and  univocal  function  of  repeated  observations 
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must  in  the  same  way,  as  an  isolated  observation,  be  considered  the  presumptive  mean  of 
this  function,  for  instance  //r  =  ^t  (//,.)• 

Thus,  from  the  equations  37—41,  we  get  by  m  repetitions: 


=  m-1^'2 


L,  = 

3     === 
/U     = 


~ 


(m  —  1)  (m2  — Qn 

4  '  60 


(47) 


(m  —  1)  (m  —  2)  (m2  - 
as  to  >16,    ^7,    ^8   it  is   preferable  to   use  the  equations  42 — 44  themselves,  putting  only 

Inversely,  if  the  presumptive  law  of  errors  is  known  in  this  way,  or  by  adoption 
of  any  theory  or  hypothesis,  we  predict  the  future  observations,  or  functions  of  observations, 
principally  by  computing  their  presumptive  mean  values.  These  predictions  however,  though 
univocal,  are  never  to  be  considered  as  exact  values,  but  only  as  the  first  and  most  impor- 
tant terms  of  laws  of  errors. 

If  necessary,  we  complete  our  predictions  with  the  mean  errors  and  higher  half- 
invariants,  computed  for  the  predicted  functions  of  observations  by  the  presumed  law  of 
errors,  which  itself  belongs  to  the  single  observations.  These  supplements  may  often  be 
useful,  nay  necessary,  for  the  correct  interpretation  of  the  prediction.  The  ancient  oracles 
did  not  release  the  questioner  from  thinking  and  from  responsibility,  nor  do  the  modern 
ones;  yet  there  is  a  difference  in  the  manner.  If  the  crossing  of  a  desert  is  calculated  to 
last  20  days,  with  a  mean  error  of  one  day,  then  you  would  be  very  unwise,  to  be  sure, 
if  you  provided  for  exactly  20  days ;  by  so  doing  you  incur  as  great  a  probability  of  dying 
as  of  living.  Even  with  provisions  for  21  days  the  journey  is  evidently  dangerous.  But 
if  you  can  carry  with  you  provisions  for  23—25  days,  the  undertaking  may  be  reasonable. 
Your  life  must  be  at  stake  to  make  you  set  out  with  provisions  for  only  17  days  or  less. 

In  addition  to  the  uncertainty  provided  against  by  the  presumptive  law  of  error, 
the  prediction  may  be  vitiated  by  the  uncertainty  of  the  data  of  the  presumptive  law  itself. 
When  this  law  has  resulted  from  purely  theoretical  speculation,  it  is  always  impossible  to 
calculate  its  uncertainty.  It  may  be  quite  exact,  or  partially  or  absolutely  false,  we  are. 
left  to  choose  between  its  admission  and  its  rejection,  as  long  as  no  trial  of  the  prediction 
by  repeated  observations  has  given  us  a  corresponding  law  of  actual  errors,  by  which  it 
can  be  improved  on. 
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It'  the  law  of  presumptive  errors  has  been  computed  by  means  of  a  law  of  actual 
errors,  we  can,  according  to  (37),  employ  the  values  >l,,  yla,  ...  and  the  number  m  of 
;utii;il  observations  for  the  determination  of  ^r  (/*,).  In  this  case  the  complete  half-invari- 
ants of  a  predicted  single  observation  are  given  analogously  to  the  law  of  errors  of  the  sum 
of  two  bondless  observations  by 


Though  we  can  in  the  same  way  compute  the  uncertainties  of  /12 ,  >13 ,  and  ^4,  it 
is  far  more  difficult,  or  rather  impossible,  to  make  use  of  these  results  for  the  improvement 
of  general  predictions. 

Of  the  higher  half-invariants  we  can  very  seldom,  if  ever,  get  so  much  as  a  rough 
estimate  by  the  method  of  laws  of  actual  errors.  The  same  reasons  that  cause  this 
difficulty,  render  it  a  matter  of  less  importance  to  obtain  any  precise  determination. 
Therefore  the  general  rule  of  the  formation  of  good  laws  of  presumptive  errors  must  be: 

1.  In  determining  >11  and  >12,  rely  almost  entirely  upon  the  actual  observed  values. 

2.  As   to   the  half-invariants  with  high   indices,  say   from   >I6   upwards,  rely  as 
exclusively  upon  theoretical  considerations. 

3.  Employ  the  indications  obtainable  by  actual  observed  values  for  the  intermediate 
half-invariants  as  far  as  possible  when  you  have  the  choice  between  the  theories  in  (2). 

From  what  is  said  above  of  the  properties  of  the  typical  law  of  errors,  it  is  evident 
that  no  other  theory  can  fairly  rival  it  in  the  multiplicity  and  importance  of  its  applications. 
It  is  not  only  constantly  applied  when  >13,  >14,  and  X5  are  proved  to  be  very  small,  but  it 
is  used  almost  universally  as  long  as  the  deviations  are  not  very  conspicuous.  In  these 
cases  also  great  efforts  will  be  made  to  reduce  the  observations  to  the  typical  form  by 
modifying  the  methods  or  by  substituting  means  of  many  observed  values  instead  of  the 
non-typical  single  observations.  The  preference  for  the  typical  observations  is  intensified 
by  the  difficulty  of  establishing  an  entirely  correct  method  of  adjustment  (see  the  following 
chapters)  of  observations  which  are  not  typical. 

In  those  particular  cases  where  ^3  or  ^4  or  >15  cannot  be  regarded  as  small,  the 
theoretical  considerations  (proposition  2  above)  as  to  /( 6  and  the  higher  half-invariants  ought 
not  to  result  in  putting  the  latter  =  0.  As  shown  in"Videnskabernes  Selskabs  Oversigter", 
1899,  p.  140,  such  laws  of  errors  correspond  to  divergent  series  or  imply  the  existence  of 
imaginary  observations.  The  coefficients  kr  of  the  functional  law  of  errors  (equation  (6)) 
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have  this  merit  in  preference  to  the  half-invariants,  that  no   term  implies   the   existence 
of  any  other. 

This  series 


- 
where  <f>(x)  =  e      2^      (the  direct  expression  (31)  is  found  p.  35),  is  therefore  recommended 

as  perhaps  the  best  general  expression  for  non-typical  laws  of  errors.  The  functional  form 
of  the  law  of  errors  has  here,  and  in  every  prediction  of  future  results,  the  advantage  of 
showing  directly  the  probabilities  of  the  different  possible  values. 

The  skew  and  other  non-typical  laws  of  errors  seem  to  have  some  very  interesting 
applications  to  biological  observations,  especially  to  the  variations  of  species.  The  scientific 
treatment  of  such  variations  seems  altogether  to  require  a  methodical  use  of  the  notion  of 
laws  of  errors.  Mr.  K.  Pearson  has  given  a  series  of  skillful  computations  of  biological  and 
other  similar  laws  of  errors  (Contributions  to  the  Math.  Theory  of  Evolution,  Phil.  Trans. 
V.  186,  p.  343).  Here  he  makes  very  interesting  efforts  to  develop  the  refractory  binomial 
functions  into  a  basis  for  the  treatment  of  skew  laws  of  errors.  But  there  are  evidently 
no  natural  links  between  these  functions  and  the  biological  problems,  and  the  above  formulas 
(31)  will  prove  to  be  easier  and  more  powerful  instruments.  In  cases  of  very  abnormal 
or  discontinuous  laws  of  errors,  more  refined  methods  of  adjustment  are  required. 

Example  1.  From  the  500  experiments  given  in  §  14  are  to  be  calculated  the 
presumptive  half-invariants  up  to  ^5  and  by  (31)  the  frequencies  of  the  special  events  out 
of  a  number  of  s0  =  500  new  repetitions.  You  will  find  Xl  =  11-86,  ^2  ==4-1647,  ^3  = 
4-708,  ^4  «=*  3-895,  and  /15  ==  —  26-946.  A  comparison  of  the  computed  frequencies  with 
the  observed  ones  gives: 

Frequency 


Events 

computed 

observed 

0—  C 

4 

o-o 

0 

o-o 

5 

-0-1 

0 

+    0-1 

6 

-0-3 

0 

+    0-3 

7 

1-6 

3 

-f    1-4 

8 

12-3 

7 

5-3 

9 

39-6 

35 

4-6 

10 

78-2 

101 

+  22-8 

11 

104-1 

89 

-15-1 

12 

97-7 

94 

3-7 

13 

69-4 

70 

+    0-6 

14 

42-8 

46 

+    3-2 
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15 

n; 

17 
18 
19 
20 
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l-'n-'iuency 
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W-7 

30 

HH> 

15 

8-0 

4 

3-0 

5 

0-8 

1 

0-2 

0 

0-0 

0 

o— c 
-f    3-3 
1-0 
4-0 

+    2-0 

+   0-2 

-   0-2 

00 


Example  2.  Determine  the  law  of  errors  by  experiments  with  alternative  results, 
either  "yes"  observed  m  times  and  every  time  indicated  by  1,  or  "no"  observed  n  times 
and  indicated  by  0.  What  is  the  square  of  the  mean  error  for  the  single  experiment? 


X2  = 


inn 


(m  -(-  n)  (m  -\-  n  —  1)  ' 
for  the  probability  determined  by  the  whole  series? 


mn 


and  for  the  frequency  of  "yes"  in  the  m  -f  n  experiments? 

mn 

X2  (s.)  =  -          —=. 
m-\-n  —  \ 

§  34.  If  observations  are  made  and  repeated,  although  their  presumptive  mean 
value  is  previously  known,  exactly  or  very  accurately,  the  law  of  presumptive  errors  of 
the  half-invariants  ^2,  /^3  ...  must  be  computed  by  reducing  the  zero  of  the  observation 
to  the  known  x\.  Putting  thus  .Sj  =  0  and  fil  =  0  in  the  equations  (19)  and  (21)  we 
obtain  in  analogy  to  (38) — (41)  the  following  modified  equations,  the  number  of  repetitions 
being  =  m: 

Xj(^2)    =   fJL<t    =    Xo 
^l(^a)    ==    f*3     ==     ^3 

m  —  3 


I  (-J  =  ^4  = 


m 

m  — 


T,=i 
m    2 


w 


m 


X  q  X  <>  • 


(4S, 


From  the  first  of  these  equations  we  deduce  the  very  important  principle,  that 
every  mean  of  the  squares  of  differences  between  repeated  bond-free  observations  and  their 
presumptive  mean  value  is  approximately  equal  to  the  square  of  the  mean  error 

2(°-^  -  -  x2.  (49) 


7* 
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Consequently,  for  any  isolated  observed  value  we  must  expect  that 


(50) 


§  35.  In  the  following  chapters,  and  in  almost  all  practical  applications,  we  shall 
work  only  with  the  typical  law  of  errors  as  our  constant  supposition.  This  gives  simplicity 
and  clearness,  and  thus  a  ±  b  may  be  recommended  as  a  short  statement  of  the  law  of 
errors,  a  =  yl1  indicating  a  result  of  an  observation  found  directly  or  indirectly  by  compu- 
tation with  observations,  and  b  =  V^2  expressing  the  mean  error  of  the  same  result. 

By  the  "weights"  of  observations  we  understand  numbers  inversely  proportional  to 

T* 

the  squares  of  the  mean   errors,  consequently  v  =  -r.    The  idea  presents  itself  when  we 

X2 

speak  of  the  means  of  various  numbers  of  observed  values  which  have  been  obtained   by 

the  same   method,   as   the   latter  numbers  here,  according  to  (37),  represent  the  weights 

When  vr  is  the  weight  of  the  partial  mean  value  mr,   the  total  mean   value  m  must  be 
computed  according  to  the  formula 


which  is  analogous  to  the  formula  for  the  abscissa  of  the  centre  of  gravity,  if  mr  is  the 
abscissa  of  any  single  body,  vr  its  weight.  We  speak  also  of  the  weights  of  single  obser- 
vations, according  to  the  above  definition,  and  particularly  in  cases  where  we  can  only 
estimate  the  relative  goodness  of  several  observations  in  comparison  to  the  trustworthiness 
of  the  means  of  various  numbers  of  equally  good  observations. 

The  phrase  'probable  error,  which  we  still  find  frequently  employed  by  authors  and 
observers,  is  for  several  reasons  objectionable.  It  can  be  used  only  with  typical  or  at  any 
rate  symmetrical  laws  of  errors,  and  indicates  then  the  magnitude  of  errors  for  which  the 
probabilities  of  smaller  and  larger  errors  are  both  equal  to  \.  The  simultaneous  use  of 
the  ideas  "mean  error"  and  "probable  error"  causes  confusion,  and  it  is  evidently  the  latter 
that  must  be  abandoned,  as  it  is  less  commonly  applicable,  and  as  it  can  only  be  computed 
in  the  cases  of  the  typical  law  of  errors  by  the  previously  computed  mean  error  as 
0-6745  1//2  ,  while  on  the  other  hand  the  computation  of  the  mean  error  is  quite 
independent  of  that  of  the  probable  error.  As  errors  which  are  larger  than  the  probable 
one,  still  frequently  occur,  this  idea  is  not  so  well  adapted  as  the  mean  error  to  serve  as 
a  limit  between  the  frequent  "small"  errors  and  the  rarer  "large"  ones.  The  use  of  the 
probable  error  tempts  us  constantly  to  overvalue  the  degree  of  accuracy  we  have  attained. 

More  dangerous  still  is  another  confusion  which  now  and  then  occurs,  when  the 
very  expression  mean  error  is  used  in  the  sense  of  the  average  error  of  the  observed  values 
according  to  their  numerical  values  without  regard  to  the  signs.  This  gives  no  sense, 
except  when  we  are  certain  of  a  law  of  typical  errors,  and  with  such  a  one  this  ,,mean 
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error"  is  V  ^  •  '^ie  onty  reason  which  may  be  advanced  in  defence  of  the  use  of  this 
idea  is  that  we  are  spared  some  little  computations,  viz.  some  squarings  and  the  extraction 
of  a  square  root,  which,  however,  we  rarely  need  work  out  with  more  than  three  signi- 
ti.  ;inl,  figures. 

IX.     FREE  FUNCTIONS. 

§  36.  The  foregoing  propositions  concerning  the  laws  of  errors  of  functions  - 
especially  of  linear  functions  -  -  form  the  basis  of  the  theory  of  computation  with  observed 
values,  a  theory  which  in  several  important  things  differs  from  exact  mathematics.  The 
result,  particularly,  is  not  an  exact  quantity,  but  always  a  law  of  errors  which  can  he 
represented  by  its  mean  value  and  its  mean  error,  just  like  the  single  observation.  More- 
over, the  computation  must  be  founded  on  a  correct  apprehension  of  what  observations 
we  may  consider  mutually  unbound,  another  thing  which  is  quite  foreign  to  exact  mathe- 
matics. For  it  is  only  upon  the  supposition  that  the  result  R  =  rlol  -\-...rnon  =  \ro] 
-  observe  the  abbreviated  notation  -  -  is  a  linear  function  of  unbound  observations  only, 
ol  ...on,  that  we  have  demonstrated  the  rules  of  computation  (35) 

^(R)  =  rMoJ  +  . . .  +  rn^(on)  -•  [r^(o)]  (52) 

**(*)  -  r'MoJ  +  . . .  +  rW,(o.)  =  [r«^(o)].  (53) 

While  the  results  of  computations  with  observed  quantities,  taken  singly,  have  laws 
of  errors  in  the  same  way  as  the  observations,  they  also  resemble  the  observations  in  the 
circumstances  that  there  can  be  bonds  between  them,  and,  unfortunately,  there  can  be 
bonds  between  "results1',  even  though  they  are  derived  from  unbound  observations.  If 
only  some  observations  have  been  employed  in  the  computation  of  both  R'  =  \r'o\  and 
R"  =  [r"o],  these  results  will  generally  be  bound  to  each  other.  This,  however,  does  not 
prevent  us  from  computing  a  law  of  errors,  for  instance  for  aR  -\-  bR".  We  can,  at  any 
rate,  represent  the  function  of  the  results  directly  as  a  function  of  the  unbound  observations, 

aR'  +  bR"  =  [(ar'  +  ftr")o].  (54) 

This  possibility  is  of  some  importance  for  the  treatment  of  those  cases  in  which 
the  single  observations  are  bound.  They  must  be  treated  then  just  like  results,  and  we 
must  try  to  represent  them  as  functions  of  the  circumstances  which  they  have  in  common, 
and  which  must  be  given  instead  of  them  as  original  observations.  This  may  be  difficult 
to  do,  but  as  a  principle  it  must  be  possible,  and  functions  of  bound  observations  must 
therefore  always  have  laws  of  errors  as  well  as  others;  only,  in  general,  it  is  not  possible 
to  compute  these  laws  of  errors  correctly  simply  by  means  of  the  laws  of  errors  of  the 
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observations  only,  just  as  we  cannot,  in  general,  compute  the  law  of  errors  for  aR'  -f-  bR" 
by  means  of  the  laws  of  errors  for  R'  and  R". 

In  example  5,  §  29,  we  found  the  mean  error  in  the  determination  of  a  direction  R 

between  two  points,  which  were  given  by  bond-free  and  equally  good  (X%(x)  =  x2(y)  =  1) 

2 
measurements  of  their  rectangular  co-ordinates,  viz.:  ^2(J?)  =  -^,  and  then,  in  example  6, 

we  determined  the  angle  F  in  a  triangle  whose  points  were  determined  in  the  same  way. 
It   seems    an    obvious    conclusion   then   that,    as    V  =  R'  —  R"  ,    we   must   have   >J2(F) 

2  2  J2    I    J'2    I    J"2 

=  /1  2  (#')-)-/*  2  (#")  =  -j^  +-J772-    But  this  is  not  correct  ;  the  solution  is  /%(  F)  =    -^5-  jj~2  —  , 
where  J,  J',  and  J"  are  the  sides  of  the  triangle.    The  cause  of  this  is,   of  course,  that 
the  co-ordinates  of  the  angular  point  enter  into  both  directions  and  bind  R'  and  R"  together. 
But  it  is  remarkable  then  that,  when   F  is  a  right  angle,  the  solutions  are  identical. 
With  equally  good  unbound  observations,  o0,  on  o2,  and  o3,  we  get 
;2(o2  —  2orf  o0)  ===  6^2(o) 
^2(o3—  2o2+o,)  •==  6x\2(o), 
but 

x*2(03  —  302+30,  —  oc)  =  20/\2(o), 

although  o3—  302+30!  —  00  =  (o3—2ozjrol)  —  (o2  —  2o,-(-o0),  according  to  which  we 
should  expect  to  find 


But  if,  on  the  other  hand,  we  combine  the  two  functions 

R'  =  Oo-j_6ol  —  4o2    and    ^"  =  2ox  +  3o2  —  o3, 

where  /12C#')  =  53>i2(o)  and  Xz(R")  =  \^^(o},  and  from  this  compute  Xz  for  any  function 
aR'-{-bR",   then,    curiously    enough,    we   get    as    the    correct   result    Az(aR'-\-  bR")  = 


Gauss's  general  prohibition  against  regarding  results  of  computations  —  especially 
those  of  mean  errors  —  from  the  same  observations  as  analogous  to  unbound  observations, 
has  long  hampered  the  development  of  the  theory  of  observations. 

To  Oppermann  and,  somewhat  later,  to  Helmert  is  due  the  honour  of  having 
discovered  that  the  prohibition  is  not  absolute,  but  that  wide  exceptions  enable  us  to 
simplify  our  calculations.  We  must  therefore  study  thoroughly  the  conditions  on  which 
actually  existing  bonds  may  be  harmless. 

Let  o  M...on  be  mutually  unbound  observations  with  known  laws  of  errors,  ^  ,(<>,•), 
Az(0i),  of  typical  form.  Let  two  general,  linear  functions  of  them  be 

[P0]    =  pl0l  +... 

[qo]  =  qlol  -f  ... 


«    0   I 

-0/1 

For  a  general  function  of  these,  F=a[po]-\-b[qo],  the  correct  computation  of  the  law  of 
errors  by  means  of  F  =  \(ap-\-bq)o\  will  further  give 

-i(«.)  •         | 
/ 

.M  -•  I 

,        -rr .  J 

Ar(F)  ==  0  for  r>2. 

It  appears  then,  both  that  the  mean  values  can  be  computed  unconditionally,  as 
if  [po]  and  \qo\  were  unbound  observations,  and  that  the  law  of  errors  remains  typical. 
Only  in  the  square  of  the  mean  error  there  is  a  difference,  as  the  term  containing  the 
factor  2ab  in  L2(F)  ought  not  to  be  found  in  the  formula,  if  [po]  and  [qo]  were  not 
bound  to  one  another. 

When  consequently 

[pqlt(o)]  =  PiqMoJ  +  •  •  •  -\- pnqnL,(on)  =  0  (57) 

the  functions  [po]  and  [qo]  can  indeed  be  treated  in  all  respects  like  unbound  observations, 
for  the  law  of  errors  for  every  linear  function  of  them  is  found  correctly  determined  also 
upon  this  supposition.  We  call  such  functions  mutually  "free  functions",  and  for  such, 
consequently,  the  formula  for  the  mean  error 

*i([po]a  +  \_qo\b)  ==  a2O2^,(o)]  +  fl«b«Jlt(0)]  (58) 

holds  good. 

If  this  formula  holds  good  for  one  set  of  finite  values  of  a  and  6,  it  holds  good 
for  all. 

If  two  functions  are  mutually  free,  each  of  them  is  said  to  be  "free  of  the  other", 
and  inversely. 

Example  1.  The  sum  and  difference  of  two  equally  good,  unbound  observations 
are  mutually  free. 

Example  2.  When  the  co-ordinates  of  a  point  are  observed  with  equal  accuracy 
and  without  any  bonds,  any  transformed  rectangular  co-ordinates  for  the  same  will  be 
mutually  free. 

Example  3.  The  sum  or  the  mean  value  of  equally  good,  unbound  observations 
is  free  of  every  difference  between  two  of  these,  and  generally  also  free  of  every  (linear) 
function  of  such  differences. 


Example  4.  The  differences  between  one  observation  and  two  other  arbitrary,  un- 
bound observations  cannot  be  mutually  free. 

Example  5.  Linear  functions  of  unbound  observations,  which  are  all  different,  are. 
always  free. 

Example  6.     Functions  with  a  constant  proportion  cannot  be  mutually  free. 

§  37.  In  accordance  with  what  we  have  now  seen  of  free  functions,  corresponding 
propositions  must  hold  good  also  of  observations  which  are  influenced  by  the  same  circum- 
stances: it  is  not  necessary  to  respect  all  connecting  bonds;  it  is  possible  that  actually 
bound  observations  may  be  regarded  as  free.  The  conditions  on  which  this  may  be  the 
case,  must  be  sought,  as  in  (57),  by  means  of  the  mean  errors  caused  by  each  circumstance 
and  the  coefficients  by  which  the  circumstance  influences  the  several  observations.  —  Note 
particularly : 

Jf  two  observations  are  supposed  to  be  connected  by  one  single  circumstance  which 
they  have  in  common,  such  a  bond  must  not  be  left  out  of  consideration,  but  is  to  be 
respected.  Likewise,  if  there  are  several  bonds,  each  of  which  influences  both  observations 
in  the  same  direction. 

If,  on  the  other  hand,  some  common  circumstances  influence  the  observations  in  the 
same  direction,  others  in  opposite  directions,  and  if,  moreover,  one  class  must  be  supposed 
to  work  as  forcibly  as  the  other,  the  observations  may  possibly  be  free,  and  the  danger  of 
treating  them  as  unbound  is  at  any  rate  less  than  in  the  other  cases. 

§  38.  Assuming  that  the  functions  of  which  we  shall  speak  in  the  folloving  are 
linear,  or  at  any  rate  may  be  regarded  as  linear  when  expanded  by  Taylor's  formula, 
because  the  errors  are  so  small  that  we  may  reject  squares  and  products  of  the  deviations 
of  the  observations  from  fixed  values;  and  assuming  that  the  observations  on  . . .  on,  on 
which  all  the  functions  depend,  are  unbound,  and  that  the  values  of  ^(oj  ...  A.,(on)  are 
given,  we  can  now  demonstrate  a  series  of  important  propositions. 

Out  of  the  total  system  of  all  functions 

[po]  ==  plol  +  ...  +  pnon 

of  the  given  n  observations  we  can  arbitrarily  select  partial  systems  of  functions,  each 
partial  system  containing  all  those,  which  can  be  represented  as  functions  of  a  number  of 
m<_n  mutually  independent  functions,  representative  of  the  system, 

[ao]   =  alol  -L  . . .  -f-  anon 


[do]  =  dlol  -f  .  . .  -\-dno»  , 

of  which  no  one  can  be  expressed  as  a  function  of  the  others.    We  can  then  demonstrate 
the  existence  of  other  functions  which  are  free  of  every  function  belonging  to  the  partial 
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system  represented  by  [ao]  ----  \do\.  It  is  sufficient  to  prove  that  such  a  function 
[9°\  •—  9i°i  +  •  •  '-}~9*0»  ^  free  of  [ao]...  [do]  in  consequence  of  the  equations 
[#ayU|  =  0  ...  [yd  A9]  =  0.  For  if  so,  \go\  must  be  free  of  every  function  of  the  partial 
system, 

[(xa  -\-  ...-\-zd)o\  =  x[ao]  -j-  .  .  .  +  *[rfo"|. 
because 

=  0. 


Any  function  of  the  total  system  [po]  can  now  in  one  single  way  be  resolved 
into  a  sum  of  two  functions  of  the  same  observations,  one  of  which  is  free  of  the  partial 
system  represented  by  [ao] . . .  [do],  while  the  other  belongs  to  this  system. 

If  we  call  the  free  addendum  [p'o],  this  proposition  may  be  written 

[po]  =  =  [p'o]  +  [x  [ao]  +  . . .  +  z  [do]}.  (59) 

By  means  of  the  conditions  of  freedom,  [p'a^z]  =  .. .  =  [p'dA2]  =  0,  all  that 
concerns  the  unknown  function  [p'o]  can  be  eliminated.  We  find 

-f  •  •  •  -\- z[da 

(60) 


from  which  we  determine  the  coefficients  x  ...  z  unambiguously.  The  number  m  of  these 
equations  is  equal  to  the  number  of  the  unknown  quantities,  and  they  must  be  sufficient 
for  the  determination  of  the  latter,  because,  according  to  a  well  known  proposition  from 
the  theory  of  determinants,  the  determinant  of  the  coefficients 

ar, . . .  a, 


lr,  •  •  •  d. 

is  positive,  being  a  sum  of  squares,  and  cannot  be  =  0,  unless  at  least  one  of  the  func- 
tions [ao]  . . .  [do]  could,  contrary  to  our  supposition ,  be  represented  as  a  function  of 
the  others. 

From  the  values  of  x ...  z  thus  found,  we  find  likewise 

[p'o]  =  [po]  —  x[ao]  —  ...  —  z[do\.  (61) 


If  [po]  belongs  to  the  partial  system  represented  by  [ao]  ....  [do] ,  the  de- 
termination of  x . . . .  z  expresses  its  coefficients  in  that  system  only,  and  then  we  get 
identically  [p'o]  =  0. 


58 

But  if  we  take  [po]  out  of  the  partial  system,  then  (61)  gives  us  [p'o]  as  different 
from  zero  and  free  of  that  partial  system.  If  [po]  —  [go]  belongs  to  the  partial  system  of 
[ao]...[do],  [ao]  must  produce  in  this  manner  the  very  same  free  function  as  [po]. 


Let  [po]  . . .  [ro~\  be  n  —  m  functions,  independent  of  one  another  and  of  the  m 
functions  [ao]  . . .  [do] ;  if  we  then  find  [p'o]  out  of  [po]  and  [r'o]  out  of  [ro]  as  the  free 
functional  parts  in  respect  to  [ao] . . .  [do],  the  n  functions  [ao] . . .  [do]  and  [p'o]  . . .  [r'o] 
may  be  the  representative  functions  of  the  total  system  of  the  functions  of  ol...oni  because 
no  relation  a  [p'o]  +  ...  -|-  3  [r'o]  =  0  is  possible;  for  by  (61)  it  might  result  in  a  relation 
a  [po]  -f- . . .  -f-  d[ro]  -j-  K  [ao]  -f- . . .  -j-  <p  [do]  =  0  in  contradiction  to  the  presumed  represen- 
tative character  of  [po]  . . .  [ro]  and  [ao]  . . .  [do]. 

If  we  employ  [p'o]  . . .  [r'o]  or  other  n  —  m  mutually  independent  functions 

[go]...[ko], 

all  free  of  the  partial  set  [ao] . . .  [do],  as  representative  functions  of  another  partial  system 
of  <>!... o«,  then  every  function  of  this  system  must  be  free  of  every  function  of  the  partial 
system  [ao] . . .  [do]  (Compare  the  introduction  to  this .  §).  No  other  function  of  o^-.o,, 
can  be  free  of  [ao] . . .  [do]  than  those  belonging  to  the  system  [go] . . .  [ko] ;  otherwise  we 
should  have  more  than  n  independent  functions  of  the  n  variables  o ,  . . .  on. 


Thus  selecting  arbitrarily  a  partial  system  of  functions  of  the  observations  ol  ...on 
we  can  -  with  reference  to  given  squares  of  mean  errors  ^(oj  . . . /}2(on)  -  distribute 
the  linear  homogeneous  functions  of  these  observations  into  three  divisions: 

1)  the  given  partial  system  [ao] . . .  [do], 

2)  the  partial  system  of  functions  [go] . . .  [ko],  which  are  free  of  the  former,  and 

3)  all  the  rest,  of  which  it  is  proved  that  every  such  function  is  always  in  only  one  way 
compounded  by  addition  of  one  function  of  the  first  partial  system  to  one  of  the  second. 


The  freedom  of  functions  is  a  reciprocal  property.  If  the  second  partial  system 
\go\ . . .  [ko]  were  selected  arbitrarily  instead  of  the  first  [ao] . . .  [do],  then  only  this  latter 
would  be  found  as  the  free  functions  in  2);  the  composition  of  every  function  in  3)  would 
remain  the  same. 

Example.  Determine  the  parts  of  o1-]-o2,  o3  -fo4,  o1-|-o4,  and  o2  +o3,  which 
are  free  of  Oj4-o3  and  o2  +  o4 ,  on  the  supposition  that  all  4  observations  are  equally 
exact  and  unbound. 

Answer:    |  (Oj  4-  o2  —  o3  — -  o4),  etc. 
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$  HI).  Liko  all  other  functions  of  the  observations  o,  .  ..o»,  each  of  these  observed 
\  alms,  for  instance  o,,  is  the  sum  of  two  quantities,  one  o"  belonging  to  the  system  of 
\no\  .  .  .  |(/o],  the  other  o\  to  the  partial  system  of  \go\  .  .  .  \ko\,  which  is  free  of  this.  But 
from  Qi  ,  —  o"  +  ol  follows,  generally,  that  |  po  \  =.  \  po"]  -f  [po1  |,  and  [po"\  evidently  belongs 
to  the  system  of  \ao]  .  .  .  [do],  [po']  to  the  system  which  is  free  of  this.  Accordingly  there  must 
between  the  n  functions  o\  .  .  .  o'n  exist  m  relations  [ao']  =  .  .  .  [do']  =  0  ;  likewise  n  —  m 
relations  [go"~\  =  .  .  .  [ko"]  =  0,  between  o"}  .  .  .  o»  . 

§  40.  That  the  functions  of  observations  can  be  split  up,  in  an  analogous  way, 
into  three  or  more  free  quantities,  is  of  no  consequence  for  the  following,  except  when  we 
imagine  this  operation  to  be  carried  through  to  the  utmost.  It  is  easy  enough  to  see, 
however,  that  also  the  partial  systems  of  functions  can  be  split  up.  We  could,  for  instance, 
among  the  representatives  [ao]  .  .  .  [do]  of  one  partial  system  select  a  smaller  number 
[ao]  .  .  .  [bo],  and  from  the  others  [co]  .  .  .  [do],  according  to  (37),  separate  the  functions 
[c'o]  .  .  .  [d'o]  which  were  free  of  [ao]  .  .  .  [bo],  [c'o]  .  .  .  [d'o]  would  then  represent  the  sub- 
system of  functions,  free  of  [ao]  .  .  .  [bo],  within  the  partial  system  [ao]  .  .  .  [do]  ;  and  in 
this  way  we  may  continue  till  all  representative  functions  are  mutually  free,  every  single 
one  of  all  the  rest.  Such  a  collection  of  representative  functions  we  call  a  complete  set 
of  free  functions.  Their  number  is  sufficient  to  enable  us  to  express  all  the  observations, 
and  all  functions  of  these  observations,  as  functions  of  them;  and  their  mutual  freedom 
has  the  effect  that  they  can  be  treated,  by  all  computations  of  laws  of  errors,  quite  like 
unbound  observations,  and  thus  wholly  replace  the  original  observations. 

§41.  The  mathematical  theory  of  the  transformations  of  observations  into  free  func- 
tions is  analogous  to  the  theory  of  the  transformation  of  rectangular  co-ordinates  (comp. 
§  36,  example  2),  and  is  treated  in  several  text-books  of  the  higher  algebra  and  determinants 
under  the  name  of  the  theory  of  the  orthogonal  substitutions.  I  shall  here  enter  into 
those  propositions  only,  which  we  are  to  use  in  what  follows. 

When  we  have  transformed  the  unbound  observations  o^-.o,,  into  the  complete 
set  of  free  functions  [ao],  [b'o]...[d"o],  it  is  often  important  to  be  able  to  undertake  the 
opposite  transformation  back  to  the  observations.  This  is  very  easily  done,  for  we  have 

(  /JV  \ 

M  +  •  •  •  +  --  -  -  Wo]}  /L(o,)  ,  (62) 

L 


which  is  demonstrated  by  substitution  in  the  equations  for  the  direct  transformation 

[ao]  =  fl^o,-}-  ...  +  anon 


[dvo]  =  <o, -f...  +<&>», 
because    [«6'/l2]  =  [«rf"^2]  =  . . .  [b'dvlti  \  =  0. 
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As  the  original  observations,  considered  as  functions  of  the  transformed  observations 
[ao] . . .  \dvo],  must  be  mutually  free,  just  as  well  as  the  latter  are  free  functions  of  the 
former,  we  find  by  computing  the  squares  of  the  mean  errors  >12  (o,)  and  the  equation  that 
expresses  the  formal  condition  that  o,-  is  free  of  o*,  two  of  the  most  remarkable  properties 
of  the  orthogonal  substitutions: 

1      _      «'?      !  ...  |       <%*  (63) 

and  „  v 


If  all  observations  and  functions  are  stated  with  their  respective  mean  error  as 
unity,  or  are  divided  by  their  mean  error,  a  reduction  which  gives  also  a  more  elegant 
form  to  all  the  preceding  equations,  the  sum  of  the  squares  of  the  thus  reduced  observations 
is  not  changed  by  any  (orthogonal)  transformation  into  a  complete  set  of  free  functions. 

We  have 

)     r^^n     I     '  "  *    t     ' .-  •,,, ^          "I    77 \ i     •  •  •  ~T 


(°») 

which,  pursuant  to  the  equations  (63)  and  (64),  is  easily  demonstrated  by  working  out  the 
sums  of  the  squares  in  the  numerators  on  the  left  side  of  the  equation.  As  this  equation 
is  identical,  the  same  proposition  holds  good  also,  for  instance,  of  the  differences  between 
ol...on  and  n  arbitrarily  selected  variables  corresponding  to  them  vl...vn^  and  of  the 
corresponding  differences  between  the  values  of  the  functions.  Also  here  is 
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^  +  ...  +  ^ 


§  42.  For  the  practical  computation  of  a  complete  set  of  free  functions  it  will  be 
the  easiest  way  to  bring  forward  the  functions  of  such  a  set  one  by  one.  In  this  case  we 
must  select  a  sufficient  number  of  functions  and  fix  the  order  in  which  these  are  to  be 
taken  into  consideration.  For  a  moment  we  can  imagine  this  order  to  be  arbitrary. 

The  function  [ao],  which  is  the  first  in  this  list,  is  now,  unchanged,  taken  into  the 

transformed  set.     By  multiplying  the  selected  function  by  suitable   constants  of  the  form 

-pJ ,  and  subtracting  the  products  from  the  remaining  functions  [bo]  in  the  list,  we  can, 

[Ct  tt  A g  J 

according  to  §  38,  from  each  of  these  separate  the  addendum  which  is  free  of  the  selected 
function.  Of  these  then  the  one  which  is  founded  on  function  Nr.  2  on  the  list  is  taken 
into  the  transformed  set.  This  function  is  multiplied  in  the  same  way  and  subtracted  from 
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the  still  remaining  functions,  so  that  they  give  up  the  addenda  which  are  free  of  both  the 
selected  functions,  and  so  on.  The  following  schedule  shows  the  course  of  the  operation 
for  the  case  n  =  4. 


l-'lllll- 
tions 


Coefficients 


Sums  of  the  Products 


[    ""I  a\ 

[    bo]  bt 

[  co]  c, 

[  do]  d, 

[b'o]  b\ 

[c'o]  c\ 

[d'o]  d\ 


[d'"o] 


<   d'l   d"3 
d''  d™  d™ 


Rale  of  Computatidii 


a i  [aaX]  [abA]  [  act]  [  adX]  [  ao]  is  selected. 

bt  [baX\  [bbt]  [  bet]  [   bdX]  [  bo]-[ao].[ 

c4  [cat]  [cbt]  [  cct]  [    cdX]  [  co]-[ao]-[   cat] :  [  aat]  =  =  [   c'o} 

d<  [dat]  [dbX]  [  dcX]  [   ddl]  [  do]  -  [ao]  •  [  daX\ :  [  aaX\  =  [  d'o] 

b\  \b'b'X\  \b'c' X]  [  b'd' X\  [  b'o]  is  selected,  is  free  of  [ao] 

c\  [c'b'l]  [c'c'X]  [  c'd'X]  [c'o]  —  [b'o]  -  [  c'b't] :  [  b'b'X]  ==  [  c"o] 

d\  [d'b'X\  [d'c'  X]  [  d'd'  X]  [d'o]  —  [  b'o]  •  [  d'b'X] :  [  b'b'X]  =  [  d"o] 


I]      [c"o]  is  selected,  is  free  of  [b'o]  and  [a'o] 
[d"d"  X]      [  d"  o]  -  [c"o] .  [d"c"X] :  [c"c"t]  =  [d'"o] 

[d'"d'"X]      [d"'o]  is  free  of  [c"o],  [b'o],  and  [ao]. 


The  computations  of  the  sums  of  the  products  (in  which  for  the  sake  of  brevity 
we  have  written  A  for  Az  (o))  could  be  made  all  through  by  means  of  the  single  coef- 
ficients in  the  transformed  functions,  as  it  must  be  done  in  the  beginning  by  means 
of  the  coefficients  in  the  original  functions.  It  is  much  easier,  however,  (particularly  if 
for  some  reason  or  other  we  might  otherwise  do  without  the  computation  of  the  coeffi- 
cients of  the  transformed  functions),  to  make  use,  for  this  purpose,  of  the  following  remark- 
able property  of  these  sums  of  the  products.  We  have,  for  instance, 


™  m        T/A        [ba 
[b'c'Z]  =  \{b— a-t — 

[V         [<M 


rj,  n       r     n  [bat]        ,        [ca^]       r     ...  [bat]  [cat] 

=  [bcl]  —  [act]  ± — £  —  [bat]  ± — ^  -\  -  [aa^]  ^ — ^  -^ — ± 

J  [aaX]  J  [aaX]  J  [aat]  [aaX\ 

=  [beX\  —  [act]  -  [bat] :  [aa>l]  . 


(67) 


Consequently,  the  same  general  rule  of  computation  as,  according  to  the  schedule,  holds 
good  of  the  functions  and  their  coefficients,  holds  good  also  of  the  sums  of  the  products 
and  of  the  squares.  The  schedule  gets  the  following  appendix: 
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[bbX]  -[abX\  -  [baX]  :  [aaX]  =  [b'b'X],  [bet]  -[acX]  •  [baX]  :  [aaX\  =  [6Vx],  [bdX\-[adX\  -  [baX]  :  [aaX]  =  [b'd'X] 
[cbX]  -[abX\-[caX\  :  [aaX]  =  [MX],  [ccX]  —  [acX]-[caX\:[aaX\  =  [cVx],  [cdX~\-[adX]-[caX\  :  [aaX]  =  [c'd'X] 
[dbX]-~[abX]'[daX]:[aaX]  =  [d'b'Xl  \dcX\-[act\>[daX\:\aaX\  =  [d'c'X],  [ddX\-[adX]-[daX]:[aaX]  =[d'd'X\ 

[c'&'x]  :  [b'b'X]  =  [cVx] 
=  \d"d"X\ 


[c'c'X]-[b'c'X]  •  [c'b'X]  :  [b'b'X]  =  [cV'x], 


[d"d"X]-[c"d"X] 


:  [cV'x]  — 


As  will  be  seen,  there  is  a  check  by  means  of  double  computation  for  each  of 
the  sums  of  the-  products  properly  so  called.  The  sums  of  the  squares  are  of  special 
importance  as  they  are  the  squares  of  the  mean  errors  of  the  transformed  functions, 
X2[ao]  =  [aax],  X2[6'o]  =  [b'b'X},  X2[c"o]  =  [cV'x],  and  Xz[d"'o]  =  [d'"d'"X], 

Example.  Five  equally  good,  unbound  observations  on  o2,  o3,  o4,and  o5  represent 
values  of  a  table  with  equidistant  arguments.  The  function  tabulated  is  known  to  be  an 
integral  algebraic  one,  not  exceeding  the  3rd  degree.  The  transformation  into  free  functions 
is  to  be  carried  out,  in  such  a  way  that  the  higher  differences  are  selected  before  the  lower 
ones.  (Because  J4,  certainly,  J3  etc.,  possibly,  represent  equations  of  condition).  With  symbols 
for  the  differences,  and  with  ^2(0j)  =  1,  we  have  then: 


on 

Coefficients 

Sums  of 

the  Products 

Factors 

03 

Oot 

+00 

2+103 

+0o4 

+0o5 

1 

-1 

-2 

3       6 

3 
35 

T/    >\s\ 

0 

0 

-1 

1 

0 

-1 

2 

3 

-6  -10 

] 

.if) 

0 

1 

-2 

1 

0 

-2 

3 

.    6 

-10  -20 

I 

FJ303 

0 

-1 

3 

-3 

1 

3 

-6 

-10 

20      35 

.1 

J403 

1 

-4 

6 

-4 

1 

6 

-10 

-20 

35      70 

is  selected 

»•-*<** 

3 
35 

1  2 
35 

17 
35 

12 
35 

3 
35 

1  7 
35 

i 

-? 

0 

0 

Fj03-[-i  J403 

7 

4 

1 

3 

7 

7 

-| 

4 

7 

7 

-1 

1 

J203-f-|J403 

2 

7 

r* 

7 

1 

7 

2 
7 

2 

i 

2 

7 

0 

0 

FJ3o3—  iJ403 

i 

1 

0 

-1 

1 

0 

-1 

0 

5 
2 

is  selected 

'3-^3 

3 
35 

12 
35 

17 
35 

12 
35 

3 
35 

1  7 
35 

1 
7 

2 

1 

rJ3o3--^J4o3 

2 
35 

6 
35 

1 

1 
35 

12 
35 

7 

6 
3  5 

7 

i 

2 

J203+fJ403 

2 

7 

1 

7 

7 

1 

2 
7 

-7 

1 

7 

2 

7 

is  selected 

f  J2o3+iJ4o3 

1 

| 

1 

1 

"5 

1 

0 

are  free 

rfo.-i*,. 

1 
5 

1 
10 

0 

1 
10 

* 

0 

•  1 

10 

are  both  selected. 

M 

The   complete   set   of   free   observations    and    the   squares   of  their    mean    errors 
are  thus: 


(0)  =  o3-f  J*0 

(1)  -  •  Fjo3-.}J20 

(5)  -  J2o3  f  f  J 

(3)=  FJ»o8-$J«o 

(4)=  J403  .    0l_4o.H-6o3-4o4  +  o6,        ;,(4)  ==  70 


Through  this  and  the  preceding  chapter  we  have  got  a  basis  which  will  generally 
be  sufficient  for  computations  with  observations  and,  in  a  wider  sense,  for  computations 
with  numerical  values  which  are  not  given  in  exact  form,  but  only  by  their  laws  of  errors. 
We  can,  in  the  first  place,  compute  the  law  of  errors  for  a  given,  linear  function  of  reci- 
procally free  observations  whose  laws  of  presumptive  errors  we  know.  By  this  we  can 
solve  all  problems  in  which  there  is  not  given  a  greater  number  of  observations,  and  other 
more  or  less  exact  data,  than  of  the  reciprocally  independent  unknown  values  of  the 
problem.  When  we,  in  such  cases,  by  the  means  of  the  exact  mathematics,  have  expressed 
each  of  the  unknown  numbers  as  a  function  of  the  given  observations,  and  when  we  have 
succeeded  in  bringing  these  functions  into  a  linear  form,  then  we  can,  by  (35),  compute 
the  laws  of  errors  for  each  of  the  unknown  numbers. 

Such  a  solution  of  a  problem  may  be  looked  upon  as  a  transformation,  by  which 
n  observed  or  in  other  ways  given  values  are  transformed  into  n  functions,  each  corre- 
sponding to  its  particular  value  among  the  independent,  unknown  values  of  the  problem. 
It  lies  often  near  thus  to  look  upon  the  solution  of  a  problem  as  a  transformation,  when 
the  solution  of  the  problem  is  not  the  end  but  only  the  means  of  determining  other  un- 
known quantities,  perhaps  many  other,  which  are  all  explicit  functions  of  the  independent 
unknowns  of  the  problem.  Thus,  for  instance,  we  compute  the  6  elements  of  the  orbit  of 
a  planet  by  the  rectascensions  and  declinations  corresponding  to  3  times,  not  precisely  as 
our  end,  but  in  order  thereby  to  be  able  to  compute  ephemerides  of  the  future  places  of 
the  planet.  But  while  the  validity  of  this  view  is  absolute  in  exact  mathematics,  it 
is  only  limited  when  we  want  to  determine  the  presumptive  laws  of  errors  of  sought 
functions  by  the  given  laws  of  errors  for  the  observations.  Only  the  mean  values,  sought 
as  well  as  given,  can  be  treated  just  as  exact  quantities,  and  with  these  the  general  linear 
transformation  of  n  given  into  n  sought  numbers,  with  altogether  w2  arbitrary  constants, 
remains  valid,  as  also  the  employment  of  the  found  mean  numbers  as  independent  variables 
in  the  mean  value  of  the  explicit  functions. 

If  we  want  also  correctly  to  determine  the  mean  errors,  we  may  employ  no  other 
transformation  than  that  into  free  functions.  And  if,  to  some  extent,  we  may  choose  the 
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independent  unknowns  of  the  problem  as  we  please,  we  may  often  succeed  in  carrying 
through  the  treatment  of  a  problem  by  transformation  into  free  functions ;  for  an  unknown 
number  may  be  chosen  quite  arbitrarily  in  all  its  n  coefficients,  and  each  of  the  following 
unknowns  looses,  as  a  function  of  the  observations,  only  an  arbitrary  coefficient  in  com- 
parison to  the  preceding  one;  even  the  n^  unknown  can  still  get  an  arbitrary  factor. 
Altogether  are  \n(n-\-\)  of  the  n"-  coefficients  of  these  transformations  arbitrary. 

But  if  the  problem  does  not  admit  of  any  solution  through  a  transformation  into 
free  functions,  the  mean  errors  for  the  several  unknowns,  no  matter  how  many  there 
may  be,  can  be  computed  only  in  such  a  way  that  each  of  the  sought  numbers  are  directly 
expressed  as  a  linear  function  of  the  observations.  The  same  holds  good  also  when  the 
laws  of  errors  of  the  observations  are  not  typical,  and  we  are  to  examine  how  it  is  with 
A3  and  the  higher  half-invariants  in  the  laws  of  errors  of  the  sought  functions. 

Still  greater  importance,  nay  a  privileged  position  as  the  only  legitimate  proceeding, 
gets  the  transformation  into  a  complete  set  of  free  functions  in  the  over-determined  problems, 
which  are  rejected  as  self-contradictory  in  exact  mathematics.  When  we  have  a  collection 
of  observations  whose  number  is  greater  than  the  number  of  the  independent  unknowns 
of  the  problem,  then  the  question  will  be  to  determine  laws  of  actual  errors  from  the 
standpoint  of  the  observations.  We  must  mediate  between  the  observations  that  contradict 
one  another,  in  order  to  determine  their  mean  numbers,  and  the  discrepancies  themselves 
must  be  employed  to  determine  their  mean  deviations,  etc.  But  as  we  have  not  to  do  with 
repetitions,  the  discrepancies  conceal  themselves  behind  the  changes  of  the  circumstances 
and  require  transformations  for  their  detection.  All  the  functions  of  the  observations 
which,  as  the  problem  is  over-determined,  have  theoretically  necessary  values ,  as ,  for 
instance,  the  sum  of  the  angles  of  a  plane  triangle,  must  be  selected  for  special  use. 
Besides,  those  of  the  unknowns  of  the  problem,  to  the  determination  of  which  the  theory 
does  not  contribute,  must  come  forth  by  the  transformation  by  which  the  problem  is  to 
be  solved. 

As  we  shall  see  in  the  following  chapters  on  Adjustment,  it  becomes  of  essential 
moment  here  that  we  transform  into  a  system  of  free  functions.  The  transformation  begins 
with  mutually  free  observations,  and  must  not  itself  introduce  any  bond,  because  the  trans- 
formed functions  in  various  ways  must  come  forth  as  observations  which  determine  laws 
of  actual  errors. 

X.     ADJUSTMENT. 

§  43.  Pursuing  the  plan  indicated  in  §  5  we  now  proceed  to  treat  the  determina- 
tion of  laws  of  errors  in  some  of  the  cases  of  observations  made  under  varying  or  different 
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essential  circumstances.  But  here  we  must  be  content  with  very  small  results.  The 
general  problem  will  hardly  ever  be  solved.  The  necessary  equations  must  be  taken  from 
th.  lotality  of  the  hypotheses  or  theories  which  express  all  the  terms  of  each  law  of  error 
-  say  their  half-invariants  --  as  functions  of  the  varying  or  wholly  different  circumstances 
of  the  observations.  Without  great  regret,  however,  the  multiplicity  of  these  theoretical 
f'lioitions  can  be  reduced  considerably,  if  we  suppose  all  the  laws  of  errors  to  be  exclusively 
of  the  typical  form. 

For  each  observation  we  need  then  only  two  theoretical  equations,  one  representing 
its  presumptive  mean  value  /^(o,),  the  other  the  square  of  its  mean  error  >12  (o,),  as  func- 
tions of  the  essential  circumstances.  But  the  theoretical  equations  will  generally  contain 
other  unknown  quantities,  the  arbitrary  constants  of  the  theory,  and  these  must  be  elimi- 
nated or  determined  together  with  the  laws  of  errors.  The  complexity  is  still  great  enough 
to  require  a  further  reduction. 

We  must,  preliminarily  at  all  events,  suppose  the  mean  errors  to  be  given  directly 
by  theory,  or  at  least  their  mutual  ratios,  the  weights.  If  not,  the  problems  require  a 
solution  by  the  indirect  proceeding.  Hypothetical  assumptions  concerning  the  X.t(0i)  are 
used  in  the  first  approximation  and  checked  and  corrected  by  special  operations  which,  as 
far  as  possible,  we  shall  try  to  expose  beside  the  several  solutions,  using  for  brevity  the 
word  "criticism"  for  these  and  other  operations  connected  with  them. 

But  even  if  we  confine  our  theoretical  equations  to  the  presumptive  means  ^(0.) 
and  the  arbitrary  unknown  quantities  of  the  theory,  the  solutions  will  only  be  possible  if 
we  further  suppose  the  theoretical  equations  to  be  linear  or  reducible  to  this  form. 
Moreover,  it  will  generally  be  necessary  to  regard  as  exactly  given  many  quantities  really 
found  by  observation,  on  the  supposition  only  that  the  corresponding  mean  errors  will  be 
small  enough  to  render  such  irregularity  inoffensive. 

In  the  solution  of  such  problems  we  must  rely  on  the  found  propositions  about 
functions  of  observations  with  exactly  given  coefficients.  In  the  theoretical  equations  of 
each  problem  sets  of  such  functions  will  present  themselves,  some  functions  appearing  as 
given,  others  as  required.  The  observations,  as  independent  variables  of  these  functions, 
are,  now  the  given  observed  values  o,,  now  the  presumptive  means  ^(o,);  the  latter  are, 
for  instance,  among  the  unknown  quantities  required  for  the  exact  satisfaction  of  the 
theoretical  equations. 

What  is  said  here  provisionally  about  the  problems  that  will  be  treated  in  the 
following,  can  be  illustrated  by  the  simplest  case  (discussed  above)  of  n  repetitions  of  the 
same  observation,  resulting  in  the  observed  values  on  ...  on.  If  we  here  write  the  theo- 
retical equations  without  introducing  any  unnecessary  unknown  quantities,  they  will  show 
the  forms  0  =  ^(o.)  —  ^(o*)  or,  generally,  0  ==  /(l  [a(o,—  o*)].  But  these  equations  are 
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evidently  not  sufficient  for  the  determination  of  any  ^(o,),  which  they  only  give  if  another 
/^(oj.)  is  found  beforehand.  The  sought  common  mean  cannot  be  formed  by  the  introduc- 
tion of  the  observed  values  into  any  function  [a(o,-  —  o*)],  these  erroneous  values  of  the 
functions  being  useful  only  to  check  ^  (o,-)  by  our  criticism.  But  we  must  remember  what 
we  know  about  free  functions:  that  the  whole  system  of  these  functions  [a(o,-  —  ok}\  is  only 
a  partial  system,  with  n  —  1  differences  Oi  —  ok  as  representatives.  The  only  wth  functions 
which  can  be  free  of  this  partial  system,  must  evidently  be  proportional  to  the  sum 
o ,  -f- . . .  -j-  on ,  and  by  this  we  find  the  sought  determination  by 

^l(°«)    =    —  (°1-+"  -  +  0n), 

IV  s 

the  presumptive  mean  being  equal  to  the  actual  mean  of  the  observed  values. 

If  we  thus  consider  a  general  series  of  unbound  observations,  o^  ...  on,  it  is  of 
the  greatest  importance  to  notice  first  that  two  sorts  of  special  cases  may  occur,  in  which 
our  problem  may  be  solved  immediately.  It  may  be  that  the  theoretical  equations  concern- 
ing the  observations  leave  some  of  the  observations,  for  instance  o1,  quite  untouched;  it 
may  be  also  that  the  theory  fully  determines  certain  others  of  the  observations,  for 
instance  on. 

In  the  former  case,  that  is  when  none  of  all  the  theories  in  any  way  concern  the 
observation  on  it  is  evident  that  the  observed  value  o^  must  be  approved  unconditionally, 
liven  though  this  observation  does  not  represent  any  mean  value  found  by  repetitions,  but 
stands  quite  isolated,  it  must  be  accepted  as  the  mean  A,^)  in  its  law  of  presumptive 
errors,  and  the  corresponding  square  of  the  mean  error  ^(oj  must  then  be  taken, 
unchanged,  from  the  assumed  investigations  of  the  method  of  observation. 

If,  in  the  latter  case,  on  is  an  observation  which  directly  concerns  a  quantity  that 
can  be  determined  theoretically  (for  instance  the  sum  of  the  angles  of  a  rectilinear  triangle), 
then  it  is,  as  such,  quite  superfluous  as  long  as  the  theory  is  maintained,  and  then  it 
must  in  all  further  computations  be  replaced  by  the  theoretically  given  value;  and  in 
the  same  way  ^2  (°»)  must  be  replaced  by  zero,  as  the  square  of  the  mean  error  on  the 
errorless  theoretical  value. 

The  only  possible  meaning  of  such  superfluous  observations  must  be  to  test  the 
correctness  of  the  theory  for  approbation  or  rejection  (a  third  result  is  impossible  when 
we  are  dealing  with  any  real  theory  or  hypothesis),  or  to  be  used  in  the  criticism. 

In  such  a  test  it  must  be  assumed  that  the  theoretical  value  corresponding  to  0B, 
which  we  will  call  WB,  is  identical  with  the  mean  value  in  the  law  of  presumptive  errors 
for  on,  consequently,  that  wre  =  ^1(0B),  and  the  condition  of  an  affirmative  result  must  be 
obtained  from  the  square  of  the  deviation,  (on  —  un)z  in  comparison  with  ^2  (on).  The 
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ei|iialinn    (</„      //,,)-        /.,("„)    n I    nnl    In-    r\ur1|\    Miiisiii-d,    but   the  approximation  must 

at  any  rate  l>r  so  rinse  thai   \\e  may  expect  in   liml  *.,(<>„)  coming  out  as  the  mean  of 
numerous  observed  values  of  (o,t       un)-.     Compare 

£  44.  If  then  all  the  observations  ot  ...  on  1'all  under  one  or  the  other  of  these  two 
cases,  I  lie  mailer  is  simple  enough.  But  generally  the  observations  o,  will  be  connected 
by  theoretical  equations  of  condition  which,  separately,  are  insufficient  for  the  determination 
of  the  single  ones.  Then  the  question  is  whether  we  can  transform  the  series  of  observations 
in  such  a  way  that  a  clear  separation  between  the  two  opposite  relations  to  the  theory 
can  be  made,  so  that  some  of  the  transformed  functions  of  the  observations,  which  must 
be  mutually  free  in  order  to  be  treated  as  unbound  observations,  become  quite  independent 
id'  I  lie  theory,  while  the  rest  are  entirely  dependent  on  it.  This  can  be  done,  and  the 
computation  with  observations  in  consequence  of  these  principles,  is  what  we  mean  bj| 
the  word  "adjustment". 

For  as  every  theory  can  be  fully  expressed  by  a  certain  number,  n  —  m,  of  theoretical 
equations  which  give  the  exact  values  of  the  same  number  of  mutually  independent  linear 
functions,  and  as  we  are  able,  as  we  have  seen,  from  every  observation  or  linear  function 
of  the  observations,  in  one  single  way,  to  separate  a  function  which  is  free  and  independent 
of  these  just  named  theoretically  given  functions,  and  which  must  thus  enter  into  another 
system,  represented  by  m  functions,  this  system  must  include  all  those  functions  of  the  ob- 
servations which  are  independent  of  the  theory  and  cannot  be  determined  by  it.  Each  of  the 
thus  mutually  separated  systems  can  be  imagined  to  be  represented,  the  theoretical  system  by 
•H—M,  the  non -theoretical  or  empirical  system  by  m  mutually  free  functions,  which  together 
represent  all  observations  and  all  linear  functions  of  the  same,  and  which  may  be  looked 
upon  as  a  complete,  transformed  system  of  free  functions,  consequently  as  unbound  obser- 
vations. The  two  systems  can  be  separated  in  a  single  way  only,  although  the  represen- 
tation of  each  partial  system,  by  free  functions,  can  occur  in  many  ways. 

It  is  the  idea  of  the  adjustment,  by  means  of  this  transformation,  to  give  the 
theory  its  due  and  the  observations  theirs,  in  such  a  way  that  every  function  of  the  theo- 
retical system,  and  particularly  the  n  —  m  free  representatives  of  the  same,  are  exchanged, 
each  with  its  theoretically  given  value,  which,  pursuant  to  the  theory,  is  free  of  error.  On 
the  other  hand,  every  function  of  the  empiric  system  and,  particularly,  its  m  free  representa- 
tives remain  unchanged  as  the  observations  determine  them.  Every  general  function  of 
the  n  observations  [do]  and,  particularly,  the  observations  themselves  are  during  the  adjust- 
ment split  into  two  univocally  determined  addenda:  the  theoretical  function  [d'o],  which 
should  have  a  fixed  value  D',  and  the  non-theoretical  one  \d"o~\.  The  former  [d'o]  is  by 
the  adjustment  changed  into  D'  and  made  errorless,  the  latter  is  not  changed  at  all.  The 
result  of  the  adjustment,  D'-\-[d"o\,  is  called  the  adjusted  value  of  the  function,  and  may 
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be  indicated  as  [du],  the  adjusted  values  of  the  observations  themselves  being  written 
u1...un.  The  forms  of  the  functions  are  not  broken,  as  the  distributive  principle  f(x-\-y) 
=  f(x)-{-f(y}  holds  good  of  every  homogeneous  linear  function. 

The  determination  of  the  adjusted  values  is  analogous  to  the  formation  of  the 
mean  values  of  laws  of  errors  by  repetitions.  For  theoretically  determined  functions  the 
adjusted  value  is  the  mean  value  on  the  very  law  of  presumptive  errors;  for  the  functions 
that  are  free  of  the  whole  theory,  we  have  the  extreme  opposite  limiting  case,  mean  values 
represented  by  an  isolated,  single  observation.  In  general  the  adjusted  values  [du]  are  ana- 
logous to  actual,  mean  values  by  a  more  or  less  numerous  series  of  repetitions.  For  while 
X^(\do\}  =  A2[d'o]-\-A»[d"o],  we  have  A2[du]  =  fls(D')i4M8[d"«]  =  A.2[d"o],  consequently 


smaller  than   hz  [do].    The  ratio    .  2^(,0\   is  analogous  to  the  number  of  the  repetitions  or 

A  j  I  ttWJ 

the  weight  of  the  mean  value. 

§  45.  By  "criticism"  we  mean  the  trial  of  the  —  hypothetical  or  theoretical  - 
suppositions,  which  have  been  made  in  the  adjustment,  with  respect  to  the  mean  errors  of 
the  observations;  new  determinations  of  the  mean  errors,  analogous  to  the  determinations 
by  the  square  of  the  mean  deviations,  //2,  will,  eventually  also  fall  under  this.  The  basis  of 
the  criticism  must  be  taken  from  a  comparison  of  the  observed  and  the  adjusted  values, 
for  instance  the  differences  [do]  —  [du].  According  to  the  principle  of  §  34  we  must  expect 
the  square  of  such  a  difference,  on  an  average,  to  agree  with  the  square  of  the  correspon- 
ding mean  error,  ^2  ([do]  —  [du]),  but  as  [do]  —  [du]  =  [d'o]  —  D',  and  Lz  [d'o]  = 

L2  ([do]  —  [du])  =  yl2  [do]  —  ^  [du] ,  (68) 

which,  by  way  of  parenthesis,  shows  that  the  observed  and  the  adjusted  values  of  the  same 
function  or  observation  cannot  in  general  be  mutually  free.  We  ought  then  to  have 

"i — m"  — i — TJ^I  ==  1  (69) 

/2  [do]  —  ^2  \du] 

on  the  average;  and  for  a  sum  of  terms  of  this  form  we  must  expect  the  mean  to  approach 
the  number  of  the  terms,  nota  bene,  if  there  are  no  bonds  between  the  functions  [do]— [du]', 
but  in  general  such  bonds  will  be  present,  produced  by  the  adjustment  or  by  the  selec- 
tion of  the  functions. 

It  is  no  help  if  we  select  the  original  and  unbound  observations  themselves,   and 

consequently  form  sums  such  as 

(o  —  u)2 


for  after  the  adjustment  and  its  change  of  the  mean  errors,  ^ll...un  are  not  generally 
free  functions  such  as  o1...on.  Only  one  single  choice  is  immediately  safe,  viz.,  to  stick 
to  the  system  of  the  mutually  free  functions  which,  in  the  adjustment,  have  themselves 
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represented  the  observations:  the  n  —  m  theoreticallly  given  functions  and  the  m  which 
the  adjustment  determines  by  the  observations.  Only  of  these  we  know  that  they  are  free 
both  before  and  after  the  adjustment.  And  as  the  differences  of  the  last-mentioned  m 
functions  identically  vanish,  the  criticism  must  be  based  upon  the  n  —  m  terms  corresponding 
to  the  theoretically  free  functions  [ao]  •=  A , . . .  [b'o]  =  B'  of  the  series 


•*•*•/ i  i  \L     VJ "  I \  U    ^  J I        i  i 

-  /j  j  ttU\  A  %  [O  0\  —  X2  [6  M]  r'''//  •.:  I  \y  v  *j  J 

the  sum  of  which  must  be  expected  to  be  =  w  —  m. 

Of  course  we  must  not  expect  this  equation  to  be  strictly  satisfied;  according  to 
the  second  equation  (46)  the  square  of  the  mean  error  on  1,  as  the  expected  value  of  each 
term  of  the  series,  ought  to  be  put  down  =  2 ;  for  the  whole  series,  consequently,  we  can 
put  down  the  expected  value  as  n  —  m  ±  1/2  (n  —  m). 

But  now  we  can  make  use  of  the  proposition  (66)  concerning  the  free  functions. 
It  offers  us  the  advantage  that  we  can  base  the  criticism  on  the  deviations  of  the  several 
observations  from  their  adjusted  values,  the  latter,  we  know,  being  such  a  special  set  of 
values  as  may  be  compared  to  the  observations  like  vt  ...vn  loc.  cit. ;  ul  ...un  are  only 
distinguished  from  vl  . .  .vn  by  giving  the  functions  which  are  free  of  the  theory  the  same 
values  as  the  observations.  We  have  consequently 

(\ao\-  A)*   .  .  ([b'o]  —  B')*        \(o  —  uY 


[aaJlt]  (OTJ          '  [   Jt  (o) 

If  we  compare  the  sum  on  the  right  side  in  this  expression  with  the  above   men- 
r     t0  _  w\2     i 

tioned        v         ~-  ,  which  we  dare  not  approve  on  account  of  the  bonds  produced  by 
L/2  (o)  —  x2  (u)\  r2i 

the  adjustment,  then  there  is  no  decided  contradiction  between  putting  down 


\     (o  _  w)2     1  "•      2' 

at  the  smaller  value  n  —  m  only,  while  i-r-4-T-*  -;—  r-d,  by  the  diminution  of  the  denomi- 

U»(«0  —  !?•(«)! 

nators,  can  get  the  value  n;  only  we  can  get  no  certainty  for  it. 

The  ratios  between  the  corresponding  terms  in  these  two  sums  of  squares,  conse- 
quently -?—          2v  '  =  1  —  T7\5  we  call  "scales",  viz.  scales  for  measuring  the  influence 

X2  (0)  X2  (0) 

of  the  adjustment  on  the  single  observation.    More  generally  we  call 

1  -   .2  r  ,  \  the  scale  for  the  function  [do].  (72) 

/2  L       J 

If  the  scale  for  a  function  or  observation  has  its  greatest  possible  value,  viz.  1, 
^2[c?w]==0.  The  theory  has  then  entirely  decided  the  result  of  the  adjustment.  But  if 
the  scale  sinks  to  its  lowest  limit  =  0,  we  get  just  the  reverse  /12  [du]  —  ^[do],  i.  e.  the 
theory  has  had  no  influence  at  all;  the  whole  determination  is  based  on  the  accidental 
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<Q M\2  Q 

value  of  the  observation,   and   for   observations  in    this  case   we  get  — ^r  -r  =  -. 

/2  (o)  —  ;2  (M)        0 

Even  though  the  scale  has  a  finite,   but  very   small   value  it   will  be  inadmissible  to   de- 
pend on   the   value   of  such  a  term   becoming  —  1.     We   understand  now,   therefore,  the 

(Q  WV21 

superiority   of  the   sum    of   the  squares  ;      =  n  —  m  to   the  sum   of  the  squares 


1  =  n   as  a  bearer  of  the  summary  criticism. 


We  may  also  very  well,  on  principle,  sharpen  the  demand  for   adjustment  on   the 

F/0  w)21 

part   of  the  criticism,   so  that  not  only  the  whole  sum   of  the   squares  must- 

approach  the  value  n  —  m,  but  also  partial  sums,  extracted  from  the  same,  or  even  its 
several  terms,  must  approach  certain  values.  Only,  they  are  not  to  be  added  up  as 
numbers  of  units,  but  must  be  sums  of  the  scales  of  the  corresponding  terms.  So  much 

we  may  trust  to  the  sum  of  the  squares    — y-r—     —j-^l   that  this  principle,   when  judi- 

L*2  (°)          *2  (U)\ 

ciously  applied,  may  be  considered  as  fully  justified. 

The  sum   of  the   squares  possesses   an  interesting  property  which   all 

other  authors  have  used  as  the  basis  of  the  adjustment,  under  the  name  of  "the  method  of 
the  least  squares".  The  above  sum  of  the  squares  gets  by  the  adjustment  the  least  possible 

170  #)21 

value  that     '          y      can  get  for  values  v1  ...vn  which  satisfy  the  conditions  of  the  theory. 

L    4s  (°)   ] 
The  proposition  (66)  concerning  the  free  functions  shows  that  the  condition  of  this  minimum 

is  that  [c"o]  =  [c"w],  . . .  [d'"o]  =  [d'"u]  for  all  the  free  functions  which  are  determined  by 
the  observations,  consequently  just  by  putting  for  each  v  the  corresponding  adjusted 
value  u. 

§  46.  The  carrying  out  of  adjustments  depends  of  course  to  a  high  degree  on  the 
form  in  which  the  theory  is  given.  The  theoretical  equations  will  generally  include  some 
observations  and,  beside  these,  some  unknown  quantities,  elements,  in  smaller  number  than 
those  of  the  equations,  which  we  just  want  to  determine  through  the  adjustment.  This 
general  form,  however,  is  unpractical,  and  may  also  easily  be  transformed  through  the 
usual  mathematical  processes  of  elimination.  We  always  go  back  to  one  or  the  other  of 
two  extreme  forms  which  it  is  easy  to  handle:  either,  we  assume  that  all  the  elements 
are  eliminated,  so  that  the  theory  is  given  as  above  assumed  by  n  —  m  linear  equations 
of  condition  with  theoretically  given  coefficients  and  values,  adjustment  by  correlates;  or, 
we  manage  to  get  an  equation  for  each  observation,  consequently  no  equations  of  condition 
between  several  observations.  This  is  easily  attained  by  making  the  number  of  the  elements 
as  large  (=  m)  as  may  be  necessary:  we  may  for  instance  give  some  values  of  observations 
the  name  of  elements.  This  sort  of  adjustment  is  called  adjustment  by  elements.  We 


71 

shall  discuss  these  two  forms  in  the  following  chapters  XI  and  XII,  first  the  adjustment  by 
correlates  whose  rules  it  is  easiest  to  deduce.  In  practice  we  prefer  adjustment  by  correlates 
m  is  nearly  as  large  as  w,  adjustment  by  elements  when  m  is  small. 


XL    ADJUSTMENT  BY  CORRELATES. 

§  47.  We  suppose  we  have  ascertained  that  the  whole  theory  is  expressed  in  the 
equations  [au\  ••=  A,  ...  [cu]  =  C,  where  the  adjusted  values  u  of  the  n  observations 
are  the  only  unknown  quantities;  we  prefer  in  doubtful  cases  to  have  too  many  equations 
rather  than  too  few,  and  occasionally  a  supernumerary  equation  to  check  the  computation. 
The  first  thing  the  adjustment  by  correlates  then  requires  is  that  the  functions  [ao]  ...  [co], 
corresponding  to  these  equations,  are  made  free  of  one  another  by  the  schedule  in  §  42. 

Let  [ao],  .  ..  [c"o]  indicate  the  n  —  m  mutually  free  functions  which  we  have  got 
by  this  operation,  and  let  us,  beside  these,  imagine  the  system  of  free  functions  completed 
by  m  other  arbitrarily  selected  functions  ,  [cT'o],  .  .  .  [/  0]  ,  representatives  of  the  empiric 
functions;  the  adjustment  is  then  principally  made  by  introducing  the  theoretical  values 
into  this  system  of  free  functions.  It  is  finally  accomplished  by  transforming  back  from 
the  free  modified  functions  to  the  adjusted  observations.  For  this  inverse  transformation, 
according  to  (62),  the  n  equations  are: 


(73) 


and  according  to  (35)  (compare  also  (63)) 

^2(o,-)  =  <    — *     g  •  >|2  [ao]  -j- -f-  -  —  ^st^1*0]^ 

llrtrt/2J  rfl'VyL]2 

J  (74) 


As  the  adjustment  influences  only  the  n—m  first  terms  of  each  of  these  equations, 
we  have,  because  [au]  =  A,  . . .  [c"u]  =  C",  and  >l2[aw]  ==...==  X2  [c"w]  =  0, 

*i  =  \r-a\^A  -f  •    •  +  r  ,,1'    -.  C"  +  rs£-  -^  [d'"o]  4- .  — - —  [a^li/ljo,)    (75) 

I [««^2 J  [c"c"^8]        r  [d'"d'";8] L  r  [/^2] 

and 

r        jniZ  r,v*      \ 

N-  (76) 
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Consequently 


.          \ao\-A  .  ,,[c"o]-C"\  .__. 

=  L>(0i}  a'         +  '  •  •  +  Ci 


and 

««,-«,).  (78) 


Thus  for  the  computation  of  all  the  differences  between  the  observed  and  adjusted  values 
of  the  several  observations  and  the  squares  of  their  mean  errors,  and  thereby  indirectly  for 
the  whole  adjustment,  we  need  but  use  the  values  and  the  mean  errors  of  the  several 
observations,  the  coefficients  in  the  theoretically  given  functions,  and  the  two  values  of 
each  of  these,  namely,  the  theoretical  value,  and  the  value  which  the  observations  would 
give  them. 

The  factors  in  the  expression  for  o,  —  w,-, 


[ao\-A  [c"o]-C" 

"'1 


which  are  common  to  all  the  observations,  are  called  correlates,  and  have  given  the  method 
its  name.  The  adjusted,  improved  values  of  the  observations  are  computed  in  the  easiest 
way  by  the  formula 

Ui  «  Oi  -  ^  (o<){aiKa  +  .  .  .  +  c?Kc»}.  (79) 

By  writing  the  equation  (78) 

-  ?     , 


and  summing  up  for  all  values  of  i  from  1  to  w,  we  demonstrate  the  proposition  concerning 
the  sum  of  the  scales  discussed  in  the  preceding  chapter,  viz. 


1 — T^V-r      =    r rM  -4-  .  .  •  -\~  "f-n-n-~4    =    W  —  7W. 


§  48.  It  deserves  to  be  noticed  that  all  these  equations  are  homogeneous  with 
respect  to  the  symbol  hz.  Therefore  it  makes  no  change  at  all  in  the  results  of  the 
adjustment  or  the  computation  of  the  scales,  if  our  assumed  knowledge  of  the  mean  errors 
in  the  several  observations  has  failed  by  a  wrong  estimate  of  the  unity  of  the  mean  errors, 
if  only  the  proportionality  is  preserved;  we  can  adjust  correctly  if  we  know  only  the 
relative  weights  of  the  observations.  The  homogeneousness  is  not  broken  till  we  reach  the 
equations  of  the  criticism  : 


-z      _L       , 
[aa>l2] 

f  ....  -f 


(82) 


It  follows  that  criticism  in  this  form,  the  "summary  criticism",  can  only  be  used  to  try 
the  correctness  of  the  hypothetical  unity  of  the  mean  errors,  or  to  determine  this  if  it 
has  originally  been  quite  unknown.  The  special  criticism,  on  the  other  hand,  can,  where 
the  series  of  observations  is  divided  into  groups,  give  fuller  information  through  the  sums 
of  squares 

^-Jf   _  W^^iM),  (83) 

L2  (o)  \         ^  (o) ) 

taken  for  each  group.  We  may,  for  instance,  test  or  determine  the  unities  of  the  mean 
errors  for  one  group  by  means  of  observations  of  angles,  for  another  by  measurements  of 
distances,  etc. 

The  criticism  has  also  other  means  at  its  disposal.  Thus  the  differences  (o  —  u) 
ought  to  be  small,  particularly  those  whose  mean  errors  have  been  small,  and  they  ought 
to  change  their  signs  in  such  a  way  that  approximately 

I  °f^  =  0  (84) 

^•2  N 

for  natural  or  accidentally  selected  groups,  especially  for  such  series  of  observations  as  are 
nearly  repetitions,  the  essential  circumstances  having  varied  very  little. 

If,  ultimately,  the  observations  can  be  arranged  systematically,  either  according  to ! 
essential  circumstances  or  to  such  as  are  considered  inessential,  we  must  expect  frequent  ' 
and  irregular  changes  of  the  signs  of  o  —  u.  If  not,  we  are  to  suspect  the  observations  of  1 
systematical  errors,  the  theory  proving  to  be  insufficient. 

§  49.  It  will  not  be  superfluous  to  present  in  the  form  of  a  schedule  of  the 
adjustment  by  correlates  what  has  been  said  here,  also  as  to  the  working  out  of  the  free 
functions.  We  suppose  then  that,  among  4  unbound  observations  o,,  o2,  o3,  and  o4,  with 
the  squares  on  their  mean  errors  ^2(°i)»  ^faz)'  ^^(°s)i  an(l  'UfaJ*  ^nere  ex^  relations 
which  can  be  expressed  by  the  three  theoretical  equations 

[au\  =  alul  -\-  «2W2  ~\~  asus  +  a4M4  ==  ^ 
[bu]  =  blu^  -|-  62M2  -|-  k.6u3  -(-  64  w4  i  •  B 
\cu}  =-  r t  MI  -f  cgtt8  -f-  caw8  +  c*ut  --  =  C. 

The  schedule  is  then  as  follows: 
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The  given 

Free  functions 

Adjusted  values                 Scales 

ABC 

.«'          C' 

C" 

o,  ;2(o.) 
o,  ;2(o2) 

[ao]     [fto]     [co] 

ft;     < 

ft; 
ft; 

^; 

i 

<>,-«,     «,    ^.(0,-tt.)    ^,(W,)      1-^,(M,)^,(0,) 

»,-«,   «,  ^,(0,-.,)  i,(«.)     l-i,(«,):i.(»,) 

=  3  as  proof. 

[a«;]  [aft;]  [ac;] 

Criticism 

[fta;]  [ft  ft;]  [bcX\ 

[ft'ft';]  [ftv;] 

(o,--?*,)2:;,^,) 

[n     r    z.  ^T     r        ^T 
/*  ft  A  \    \  f*  rt  J  \    1  /*  /*  j  1 
J    L        J    L        J 

[C'ft';]  [C'c';] 

[c"C»fl 

Correlates 

[fta;]        [c«;] 

t^'ft';] 

^,,_l 

03—  M3)2:;2(o3) 
(o.     u  )2  :  ;  (o  ) 

[ao]-^ 

ES-5' 

c,o]_c,            Sum  for  proof 

[aa;] 

[ft'ft';] 

rc"c";1            an<^  summary  criticism 

The  free  functions  are  computed  by  means  of: 


B'  =  B-fiA 

b'i  =  ft,-  —  fa 
\b'o]  =  [fto]-^[ao] 

C'  =  (7-r^ 

C,'   =    C,-  —  fdi 

[c'o]  =  [co]-r[ao] 

C"  =  C'  -r'B' 

&  =  c;-r'ftj 

[c"o]  ==  [c'o]-r'[ft'o] 

'cfb'X]  =  [c6>l]-^[ca;] 

[Cv;]  =  [cc;]-r[ca;] 

[cv/ij  =  [cv;]-r'[c'ft';] 

By  the  adjustment  properly  so  called  we  compute 


Ot  —  wt-  =  (aiKa  -f-  ft^  +  Ci'Kc»)  ;2  (o,-) 


(a,-2 
p 


ftj2 


and  for  the  summary  criticism 


n— 


=  3±I/6 


In  order  to  get  a  check  we  ought  further  to  compute  [aw]  =  A,  [bu\  =  B,  and 
[CM]  =  (7,  with  the  values  we  have  found  for  wn  w2,  w3,  and  M4.  Moreover  it  is  useful  to 
add  a  superfluous  theoretical  equation,  for  instance  [(a-j-ft-(-c)w]  =  A-\-B-\-C,  through  the 
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computation  of  the  free  functions,   which  is   correct  only   if  such   a  superfluity  leads  to 
identical  results. 

§  50.  It  is  a  deficiency  in  the  adjustment  by  correlates  that  it  cannot  well  be 
employed  as  an  intermediate  link  in  a  computation  that  goes  beyond  it.  The  method  is 
good  as  far  as  the  determination  of  the  adjusted  values  of  the  several  observations  and 
the  criticism  on  the  same,  but  no  farther.  We  are  often  in  want  of  the  adjusted  values 
with  determinations  of  the  mean  errors  of  certain  functions  of  the  observations;  in  order 
to  solve  such  problems  the  adjustment  by  correlates  must  be  made  in  a  modified  form. 
The  simplest  course  is,  I  think,  immediately  after  drawing  up  the  theoretical  equations  of 
condition  to  annex  the  whole  series  of  the  functions  that  are  to  be  examined,  for  instance 
frfo], . . .  [eo],  and  include  them  in  the  computation  of  the  free  functions.  In  doing  so  we 
must  take  care  not  to  mix  up  the  theoretically  and  the  empirically  determined  functions, 
so  that  the  order  of  the  operation  must  unconditionally  give  the  precedence  to  the 
theoretical  functions;  the  others  are  not  made  free  till  the  treatment  of  these  is  quite 
finished.  The  functions  [eT'o],  . . .  \evo\ ,  which  are  separated  from  these  -  -  it  is  scarcely 
necessary  to  mention  it  —  remain  unchanged  by  the  adjustment  both  in  value  and  in 
mean  error.  And  at  last  the  adjusted  functions  [rfw],  . . .  [ew],  by  retrograde  transformation, 
are  determined  as  linear  functions  of  A,  B',  C",  [cT'o],  . . .  \#o\. 

Example  1.  In  a  plane  triangle  each  angle  has  been  measured  several  times,  all 
measurements  being  made  according  to  the  same  method,  bondfree  and  with  the  same 
(unknown)  mean  error: 

for  angle  A  has  been  found  70°  0'  5"  as  the  mean  number  of  6  measurements 
»       »      B    »       »        »      50°  0'  3"   »      »       »  »       » 10  » 

»       »       C    »       •>         »      60°  0'  2"    »      »       »  »       »  15 

The  adjusted  values   for   the  angles  are  then  70°,  50°,  and  60°,  the  mean  error  for  single 
measurement  =  1/300  =  17"3,  the  scales  0-5,  0-3,  and  0-2. 

Example  2.  (Comp.  example  §  42.)  Five  equidistant  tabular  values,  12,  19,  29, 
41,  55,  have  been  obtained  by  taking  approximate  round  values  from  an  exact  table,  from 
which  reason  their  mean  errors  are  all  =  J/^L..  The  adjustment  is  performed  under  the 
successive  hypotheses  that  the  table  belongs  to  a  function  of  the  3rd,  2nd,  and  Ist  degree, 
and  the  hypothesis  of  the  second  degree  is  varied  by  the  special  hypothesis  that  the  2nd 
difference  is  exactly  =  2,  in  the  following  schedule  marked  (or).  The  same  schedule  may 
be  used  for  all  four  modifications  of  the  problem,  so  that  in  the  sums  to  the  right  in  the 
schedule,  the  first  term  corresponds  to  the  first  modification  only,  and  the  sum  of  the 
two  first  terms  to  the  second  modification: 

10* 
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12 

19 
29 
41 
55 


i 

12 
1 
1  2 

1 
12 

1 
12 

1 
12 


J4 

0 

FJ3 
0 

J2 

0  (or  2) 

(FJ3)' 
0 

(J2)'=(J2)" 

0  (or  2) 

1 

0 

0 

i 

2 
7 

_4 

-1 

1 

1 

-i 

6 

3 

2 

0 

2 
~f 

-4 

-3 

1 

1 

7 

1 

1 

0 

i 

2 

2 

T 

1 

0 

2 

1 

2 

1  6 

7 

70 

35 

20 

1  2 

12 

12 

35 
1  2 

20 
12 

10 
12 

5 

2  4 

0 

20 
1  2 

1  0 
'12 

6 
12 

0 

1 

4  2 

/N 

_  J, 

T  :==      7                 T 

'  =  0 

T2#<. 

===  7~0 

YY                ^'  T¥J 

<>=8(orl) 

0  — 


1+  7+160  (or +20)), 
4_14-  80  (or -10)), 
6+  0-160  (or -20)), 
4+14_  80  (or -10)), 
1-  7+160  (or +20)), 


/,z(o—u) 


^(  1+  7+20) 
^(16+28+  5) 
^(36+  0+20) 
8-^(16+28+  5) 

8lo(  1+  7+2°) 


For  the  summary  criticism: 

7680  (or  120) 


__    " i         i 

"  35^35^ 


35 


The  hypothesis  of  the  third  degree^  J4  =  0,   where  the  values  of  70 ut  and  their 

differences  are: 

839  1334  2024  2874  3849 

495   690   850   975 
195   160   125 

-35  -35, 

agrees   too  well  with   the  observations,  and  must  be  suspected  of  being  underadjusted,  for 
the  sum  of  the  squares  of  the  summary  criticism  is  only 

^-,  where  we  might  expect  1  +  1/2. 
The   hypothesis   of  the   second    degree,   J4  =0,    FJ3  =  0,  gives    for    70 w,  and 

differences: 

832  1348  2024  2860  3856 

516   676   836   996 

160      160      160. 
The  adjustment  is  here  good,  the  sum  of  the  squares  is 

•||,  and  we  might  expect  2+1/4. 
The  hypothesis  of  the  first  degree,  J4  —  0,  T2/3  =  0,  J2  =  0,  gives  for  the  adjusted 

values  and  their  differences: 

9-6  20-4  31-2  42-0  52-8 

10-8  10-8  10:8  10-8. 
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The  deviations  are  evidently  too  large  (o-u  is  -f-2-4,  —1-4,  —2-2,  —1-0,  -f2-j?) 
to  be  due  to  the  use  of  round  numbers;  the  sum  of  the  squares  is  also 

±>0-8  instead  of  3^1/6, 
consequently,  no  doubt,  an  over-adjustment. 

The  special  adjustment  of  the  second  degree,  J4  =  0,  VA*  =  0,  and  J*  =  2,  gives 

for  «,  and  its  differences: 

11-6     19-4    29-2    41-0    54-8 

7-8      9-8      11-8     13-8 
The  deviations  o  -  u  =  0-4,  -0-4,  -0-2,  O'O,  -(-0-2 

nowhere  reach  £,  and  may  consequently  be  due  to  the  use  of  round  numbers;  the  sum  of 

the  squares 

4-8  instead  of  3^1/6 

also  agrees  very  well.  Indeed,  a  constant  subtraction  of  O04  from  «,  would  lead  to 
(3-4)2,  (4-4)2,  (5-4)2,  (6-4)2,  and  (7-4)2,  from  which  the  example  is  taken. 

Example  3.    Between  4  points  on  a  straight  line  the  6  distances 

°12  '       °1 3  '       °14 

°23>       °24 

°34 

are  measured  with  equal  exactness  without  bonds.     By  adjustment  we  find  for  instance 

^12  =  io12  +  i(0i3-o23)  +  {(o14-o24); 

we  notice  that  every  scale  =  |.  It  is  recommended  actually  to  work  the  example  by  a 
millimeter  scale,  which  is  displaced  after  the  measurement  of  each  distance  in  order  to 
avoid  bonds. 


XII.     ADJUSTMENT  BY  ELEMENTS. 

§  51.  Though  every  problem  in  adjustment  may  be  solved  in  both  ways,  by 
correlates  as  well  as  by  elements,  the  difficulty  in  so  doing  is  often  very  different.  The 
most  frequent  cases,  where  the  number  of  equations  of  condition  is  large,  are  best  suited 
for  adjustment  by  elements,  and  this  is  therefore  employed  far  oftener  than  adjustment 
by  correlates. 

The  adjustment  by  elements  requires  the  theory  in  such  a  form  that  each  observa- 
tion is  represented  by  one  equation  which  expresses  the  mean  value  /I  t  (o)  explicitely  as 
linear  functions  of  unknown  values,  the  "elements",  x,  y,  ...  z : 
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(85) 


where  the  /? ,  g ,  ...  r  are  theoretically  given.  All  observations  are  supposed  to  be 
unbound. 

The  problem  is  then  first  to  determine  the  adjusted  values  of  these  elements 
x,  i/,  ...  z,  after  which  each  of  these  equations  (85),  which  we  call  "equations  for  the 
observations'",  gives  the  adjusted  value  u  of  the  observation. 

Constantly  assuming  that  ^2  (o)  is  known  for  each  observation,  we  can  from  the 
system  (85)  deduce  the  following  normal  equations: 


(o) 


MO] 

_qp_ 

;2(o; 


*4- 


*  + 


w 


+  •••  + 
+  ...+ 


po 


*i-+Ea3.^'-'.-+i 


rr 

^>j 


ro 


(86) 


the  rule  of  formation  being  apparent  from  the  left  hand  terms.  Of  these  normal  equations 
we  can  prove,  first  that  they,  m  in  number,  are  suited  for  the  determination  of  the  m 
elements,  so  far  as  these,  on  the  whole,  can  be  determined  by  the  equations  (85),  and 
then  that  the  functions  of  the  observations,  which  form  their  left  hand  terms  are  free  of 
all  the  theoretical  conditions  of  the  problem,  so  that,  as  indicated  by  the  last  sign  of 
equality  in  the  normal  equations,  they  can  and  must  be  determined  by  the  directly 
observed  values  o:  ...  on. 

For  if  we  assume,  as  to  the  first  proposition,  that  any  of  the  normal  equations 
can  be  deduced  from  the  others,  so  that  all  the  elements  cannot  be  determined  by  these 
equations,  then  there  must  be  m  coefficients  h,  k,  ...  /,  so  that 


PP]   i   j.  F2P] 

T]  f  [yj 


*'¥] 


7!) 

(/I  everywhere  used  for  ^,(0));  but  if  we  multiply  these  again  respectively  by  A,  h,  .  .  .  I  and 
add,  we  get 


MO) 

that  is 

hp{  -f-  %  -f  .  .  .  -h  to  =  0  « 

so  that  not  only  the  normal   equations,  but  the  very  equations  for  the  observations  can, 
consequently,  all  be  written  with  m  —  1  or  a  smaller  number  of  elements. 

But  further,  the  system  of  functions  represented  by  the  normal  equations  is  free 
of  every  one  of  the  conditions  of  the  theory.  The  latter  we  can  get  by  eliminating  the 
elements  x,  y,  ...  z  from  the  equations  of  the  observations  (85).  But  elimination  of  an 
element,  say  for  instance  x,  leads  to  the  functions  p./ljo*)  —  jO*^i(0,),  and  among  the 
linear  functions  of  these  must  be  found  the  functions  from  which  not  only  x  but  all  the 
other  elements  are  eliminated,  and  consequently  the  conditional  equations  of  the  theory. 
But  it  is  easily  seen  that  the  functions 

and 


are  mutually  free.   The  latter  is  the  left  hand  side  of  the  normal  equation  which  is  parti- 
cularly  aimed  at  the  element  x;   it  is   formed   by  multiplying  the  equations  (85)  by  the 


coefficient  of  x  in  each,  and  has  the  sum  of  the  squares  Ky-     as  the  coefficient  of  this 

L  *  J 
element;  it  has  thus  been  proved  to  be  free  of  all  the  conditions  of  the  theory,  and  must 

therefore  in  the  adjustment  be  computed  by  the  directly  observed  values,  for  which  reason 

r  p0  i 

we  have  been  able  in  the  equations  (86)  to  rewrite  the  function  as  l-i^WI-  In  the  same 

[^(°)\ 

way  we  prove  that  all  the  other  normal  equations  are  free  of  the  theory,  each  through 

the  elimination  from  (85)  of  its  particularly  prominent  element.  While,  in  the  adjustment 
by  correlates,  we  exclusively  made  use  of  the  equations  and  functions  of  the  theory,  we 
put  all  these  aside  in  the  adjustment  by  elements,  in  order  to  work  only  with  the  empiri- 
cally determined  functions  which  the  normal  equations  represent. 

The  coefficients  of  the  elements  in  the  normal  equations  are,  as  it  will  be  seen, 
arranged  in  a  remarkably  symmetrical  manner,  and  each  of  them  has  a  significance  for 
the  problem  which  it  is  easy  to  state. 

The  coefficients  in  the  diagonal  line,  which  are  respectively  multiplied  by  the 
element  to  which  the  equation  particularly  refers,  are  as  sums  of  squares  all  positive,  and 
each  of  them  is  the  square  of  the  mean  error  for  that  function  of  the  observations  in 
whose  equation  it  occurs.  We  have  for  instance 

\PP]  ..\PP 
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The  coefficients  outside  the  diagonal  line  are  identical  in  pairs,  the  coefficient  of  a?, 
~-\  in  «/'s  particular  equation,  is  the  same  as  the  coefficient  of  y,    ^    in   x's   particular 

equation.   They  show  immediately  if  some  of  the  functions  *3|,  l~  ,  . . .    —   should  happen 

rp0i    Ml    M  J          I  x  J  r     -I 

to  be  mutually  free;   if  for  instance  o?'s  function    £-1  is  to  be  free  of  y's  function     -  -  , 

r  n         n  1  \inn\  L  *  J  \-  *  J 


we  must  have   1 4- .  4-  •  **(o)\  =  1^1  =  0. 


§  52.  If  now  the  elements  have  been  selected  in  such  a  convenient  way  that  all 
these  sums  of  the  products  vanish,  and  the  normal  equations  consequently  appear  in  the 
special  form 


^  L.  F> 

rr          -  u 


rr  ro 

I   ^     I 

T0       T 


(87) 


then  they  offer  us  directly  the  solution  of  the  problem  of  adjustment.  The  adjusted  values 
for  the  elements  are 

*•«'••;> •  -Hfl<HfHf]<--[THT]<       (88) 

and  the  squares  of  the  mean  errors 

r»»']-1  Foal-1  OH-1 

/2(^)  -    f      ,    /2(2/)  -    f      ,   -..   /2(z)  =  ,  (89) 

L/J  LAJ  L7'] 

and  from  these  we  can  then  compute  both  the  adjusted  value  and  its  X2  for  every  linear 
function  of  the  elements,  because  these  are  mutually  free  functions.  In  particular  from 
the  equations  (85), 

we  can  compute  the  adjusted  values  u(  of  the  observations,  then  from  (35)  the  squares  of 
the  mean  errors  x^(wt.),  and  also  the  law  of  errors  for  every  function  of  observations  and 
elements. 

§  53.  In  ordinary  cases  a  transformation  of  the  system  of  elements  is  required. 
It  is  required  for  the  solution  of  the  normal  equations  in  order  to  find  the  values  of  the 
elements;  but  we  must  remember  that  we  have  here  a  double  problem,  as  it  is  also  our 
object  to  free  the  transformed  elements  so  that  they  may  be  used  for  determinations  of 
the  mean  errors.  The  transformation  therefore  cannot  be  selected  so  arbitrarily  as  in 
analogous  problems  of  pure  mathematics;  yet  there  is  a  multiplicity  of  possibilities,  and 
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in  many  special  cases  radical  changes  can  lead  to  very  beautiful  solutions  (see  §62).  The 
first  thing,  however,  is  to  secure  a  method  which  may  be  always  applied;  and  this  must 
be  selected  in  such  a  way  that  the  elements  are  eliminated  one  by  one,  so  that  the  later 
computation  of  them  is  prepared,  and  moreover,  constantly,  in  such  a  way  that  freedom 
is  attained. 

This  can,  if  we  commence  for  instance  by  eliminating  the  element  x,  be  attained 
in  the  following  way.    The  normal  equation  which  particularly  refers  to  x, 

i'1'":";':"!1  (90) 

and  which  will  be  put  aside  to  be  used  later  on  for  the  computation  of  x,  is  multiplied 

by  such  factors,  viz.  <p  =    ?  :  p?L  •  •  •  a>  =  rrl :  r?"| '  that  x  vanisnes  wnen  the 

MJMJ  L  *  J  •  I  *  J 

products  are  respectively  subtracted  from  the  other  normal  equations;  but  it  must  be 
remembered  that  we  are  not  allowed  to  multiply  the  latter  by  any  factor.  The  equation 
for  x  can  then  be  written 


[V'D\~i 
'     '     " 

The  functions  in  the  other  equations 

~po 
1 

become,  by  this  means,  not  only  independent  of  x  but  also  free  of  |^|   or  of  £,  for 

L 

P     2~! 


The  equations  which  in   a  double  sense  have  been   freed  from  x,  get  exactly  the 
same  characteristic  functional  form  as  the  normal  equations  had.    If  we  write 


so  that  the  equations  for  the  observations  become 


we  not  only  get,  as  we  see  at  once, 


q'o]          \qo]  \po]  \r'o]          fro]  \po] 

~*    .....     -—  T-         <93) 


11 


but  also 


qp 


q'r' 


r'q' 


rr\  \rp 

7    "  1 


(94) 


Hence  we  proceed  exactly  in  the  same  way  from  this  first  stage  of  the  transformation   of 
the  normal  equations 


(95) 


using,  for  instance,  the  first  of  them  for  the  elimination  of  the  element  y.     If 

q'q' 


y  is  replaced  by 

.  l  4.         4.    iz  =.  [L_       

7  =  I  A  J  '  I  ^ 

which  is  free  of  the  element  £,  and  for  which  we  have 


(96) 


(97) 
By  means  of  cat  and  corresponding  coefficients  we  have,  analogously  to  (93)  and  (94), 


r"o 
A 


q'o]  |  r"r' 

T  ,  .... 


which  are  independent  of  any  special  computation  of  the  coefficients  r". 

Continuing  in  this  way,  till  we  have  obtained  a  set  consisting  only  of  free  func- 
tions, we  find,  consequently,  just  a  system  of  elements,  £,  37,  C,  which  possess  the  above- 
mentioned  desired  property,  its  normal  equations  being  of  the  same  form  as  (87),  viz.: 

po 


s. 

A 


c-P 

I  \          I  x 


(98) 
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With    these   elements    the  equations   for  the  adjusted   values    of  the  several   observations 
become 


and  for  the  squares  of  their  mean  errors 


If  we  want  to  compute  adjusted  values  and  mean  errors  for  the  original  elements  or  func- 
tions of  the  same,  the  means  of  so  doing  is  given  by  the  equations  of  transformation 


y  +  .  .  .  -j-  «>z  = 


=  C 


(101) 


or  by  (90),  the  first  equation  (95)  and  the  last  of  (98),  being  identical  with  (101).  For  not 
only  the  original  elements  x,  y,  ...  z  are  easily  computed  by  these,  but  also  the  coef- 
ficients in  the  inverse  transformation 


y- 


7+.  ..+<JC 


(102) 


Now,  if  F  is  a  given  linear  function  of  x,  y,   ...  z  ,  then  by  obvious  numerical 
operations  we  get  an  expression  for  it, 

F  «  a£+6 
and  for  the  square  of  its  mean  error  we  get 

wttl-1  oo 

+ 


If  for  special  criticism  we  want  the  computation  of  ^2(w,)  for  many  observations, 
we  may  take  advantage  of  transforming  the  equations  of  observations,  computing  their 
coefficients  by  (92),  or 


r— 


but  we  remember  that  g',  .  .  .  r"  are  quite  superfluous  for  the  coefficients  of  (95). 

§  54.  In  the  theory  of  the  adjustment  by  elements  we  must  not  overlook  the 
proposition  concerning  the  computation  of  the  minimum  sum  of  squares  for  the  benefit  of 
the  summary  criticism  as  well  as  for  checking  our  computation.  We  are  able  to  compute 


-   , 
J 


which  is   to  approach  the  value  n  —  m,   as   soon   as  we  have  found 


only  the   elements,  without  being  obliged   to  know  the  adjusted  values  for  the  separate 
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observations.  And  this  computation  can  be  performed,  not  only  for  the  legitimate  adjust- 
ment, but  for  any  values  whatever  of  the  elements.  It  is  easiest  to  show  this  for  trans- 
formed elements,  ^,37,,  •••  d-  The  values  for  the  observations  corresponding  to  these 
must  be  computed  by  (99) 

».f    4-  q'.-n    -J-  .  .  .  4-  rv.  C     =  v. . 
From  this  we  get 

(103) 


-i  r     f     f-i  r     Vrt*Vl 

T  K2  +  nr  h?'2+-"  +       ~K'2' 

*  J  L  x  J  L  *  J 


If  we  here  substitue  for    — -  ,    ^- 1,    ...   \—r\  their  values  in  terms  of  the  elements 

I  *  J     M  J  L  *J 

77,   ...   £,  of  the  legitimate  adjustment,  we  find  from  the  equations  (98) 


It  is  evident  from  this  that  the  condition  of  minimum  is  ft  =  f ,  57,  =37,  Ci  =  C-  The 
minimum  sum  of  squares  is  therefore  obtained  only  by  the  determination  of  the  functions 
that  are  free  of  the  theory,  by  means  of  their  directly  observed  values.  And  for  this 
minimum 

(o-w)2 


(106) 


10  r       f       -\  n  f~       I/      "I  9 

I  go  I2  Tr^ol2 

m  m  HOT) 

....  •  \-^^  / 

q'q'] 


It  deserves  to  be  noticed  that  the  middle  one  of  these  expressions  holds  good,  in  unchanged 
form,  also  of  the  original,  not  transformed  elements  and  coefficients.    We  have 

•<*''  "   f (t^!l  =  -1  _.  M  ,  _  fel  „  _  . . .  _  M  . ,  (108) 

L  **  (°)  J        ^  J     U  J       L  ^  J  I  /  J 

which  is  easily   proved    by   substituting  in   (106)   the  values   obtained    from    (101).    The 
equation  is  particularly  valuable  as  a  check  on  the  accuracy  of  our  computation. 

§  55.    In  going  through  the  theory  of  adjustment  by  elements  here   developed,  it 
will  be   seen  that  a  very  essential   part  of  the  work,  viz.  the  computation  of  the  trans- 
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formed  values  of  the  coefficients  in  the  equations  for  the  several  observations,  may  nearly 

always  be  dispensed  with.  The  sums  of  the  squares,  ^  ,  and  the  sums  of  the  products, 
•  1 "  L  *  J 

ill    must  be  transformed;  but  they  are  in  themselves  sufficient  for  the  determination  of 
I  *  J 
the  transformations,  and   by  their  help  we  find  values  and  mean  errors  for  the  elements, 

first  the  transformed  ones,  but  indirectly  also  the  original  ones.  The  adjusted  values 
M,  . . .  un  of  the  observations  can,  consequently,  also  be  computed  without  any  knowledge 
of  q'.  . . .  rj  . . .  r\.  Only  for  the  computation  of  ^2  (w,)  ...  ^  (un) ,  consequently  for  a 
special  criticism,  we  cannot  escape  the  often  considerable  work  which  is  necessary  for 

the  purpose. 

r(o_w)2i  

For  the  summary  criticism  by  ;      =  n  —  m±V2(n— w),  we  can  even,  as 

we  have  seen,  dispense  with  the  after-computation  of  the  several  observations  by  means 
of  the  elements.  We  ought,  however,  to  restrict  the  work  of  adjustment  so  far  only,  when 
the  case  is  either  very  difficult  or  of  slight  importance,  for  this  minimum  sum  of  squares 
is  generally  computed  much  more  sharply,  and  always  with  much  greater  certainty,  directly 
by  o,-,  w,,  and  ;,(o),  than  by  the  formulae  (105),  (106),  and  (107). 

Add  to  this,  that  the  special  criticism  does  not  exclusively  rest  on  >}2  (M,)  and  the 

scales  1-     2       ,   but  that  the  very  deviations  o,  — w,,  when  they  are  arranged  according 

to  the  more  or  less  essential  circumstances  of  the  observations,  are  even  a  main  point  in 
the  criticism.  Systematical  errors,  especially  inaccuracies  or  defects  in  hypotheses  and 
theories,  will  betray  themselves  in  the  surest  and  easiest  way  by  the  progression  of  the 
errors;  regular  variation  in  o  —  u  as  a  function  of  some  circumstance,  or  mere  absence  of 
frequent  changes  of  signs,  will  disclose  errors  which  might  remain  hidden  by  the  check 

according  to   2*     ~        =  2*  ( 1  —  ,   ,  ;  j ;   and  such   progression  in  the  errors  may,  we 

^2(0)  \        *z(o)  i 

know,  even  be  used  to  indicate  how  we  ought  to  try  to  improve  the  defective  theory. 

§  56.  By  series  of  adjustment  (compare  Dr.  J.  P.  Gram,  Udjevningsraekker,  Kjeben- 
havn  1879,  and  Crelle's  Journal  vol.  94),  i.  e.  where  the  theory  gives  the  observations  in 
the  form  of  a  series  with  an  indeterminate  (infinite)  number  of  terms,  each  term  being 
multiplied  by  an  unknown  factor,  an  element,  and  where  consequently  adjustment  by 
elements  must  be  employed,  the  criticism  gets  the  special  task  of  indicating  how  many 
(or  which)  terms  of  the  series  we  are  to  include  in  the  adjustment.  Formula  (107)  fur- 
nishes us  with  the  means  of  doing  this. 

r-ol2 

[1^11    =   fc]-v_ii_M_m±^2F^). 
I  *2  (o)  [  *  J  \rvrn 
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For  the  m  terms  in  the  series,  which  is  here  indicated  by  21,  correspond,  each  of  them,  to 
an  element,  consequently  to  one  of  the  terms  of  the  series  of  adjustment.  For  each  term 
we  take  into  this,  the  right  side  of  the  equation  of  criticism  is  diminished  by  about  a 
unity;  the  result  of  the  criticism,  consequently,  becomes  more  favourable  if  we  leave  out 

\rvo^^   [W1'!-1 
all  the  terms  for  which    — —    •  — ;—     <  1.    If  we  retain  any  terms  which  essentially  fall 

I  *  J     L  *   J 
under  this  rule,  the  adjustment  becomes  an  under-adjustment;   if,  on  the  other  hand,  we 

[Vol2    \rvrv]~l 
leave  out   terms  for  which     -—    •    — r—      >  1 ,  we   make  ourselves    guilty  of  an   over- 

L  *  J     I  *   J 
adjustment. 

Example  1.  The  five-place  logarithms  in  a  table  are  looked  upon  as  mutually 
unbound  observations  for  which  the  mean  error  is  constantly  l/TV  °f  the  fifth  decimal 
place.  The  "observations",  log  795,  log  796,  log  797,  log  798,  log  799,  log  800,  log  801, 
log  802,  log  803,  log  804,  and  log  805,  are  to  be  adjusted  as  an  integral  function  of  the 
second  degree 

log  (800  4-0  = 


In  order  to  reckon  with  small  integral  numbers,  we  subtract  before  the  adjustment 
2-90309  -f  0-00054  *,  both  from  the  observations  and  from  the  formulae.  Taking  0-00001  as 
our  unity,  we  have  then  the  equations  for  the  observations: 


_2  =  x  — 
_2  =  a  —  4*/  -fl60 
-1  =  x  —  3*/  +  9z 
-  1  =  x  —  2  40 


0  =  x 

0  =  a;  -f  1  ?/  4-  10 

0  =  x  +  2y  +  40 

1  =  z  +  s^-L  g^ 

1  =  x  -f-  4«/  -}-  160 

1  =  a  +  5^  -|-  250. 


From  this  we  get  \-r-\  =156,  and  the  normal  equations: 
L  ^  J 


•    36  =     132  a?  +       Oy+    13200 

420  =        0#  +  1320y+         00 

-540  =  1320ic+       0^  +  234960. 


The  element  y  is  consequently  immediately  free  of  x  and  0,  but  .the  latter  must  be  made 


87 


free  of  one  another,  which  is  done  by  multiplying  the  first  equation  by  10  and  subtracting 


it  from  the  third.    The  transformation  into  free  functions  then  only  requires 
substituted  for  #,  and  we  have: 


+  10 


-   36  = 

420  = 

—  180  = 


consequently, 


£  =  -0-2727, 
y  =.  0-3182 , 
z  =  -0-0175, 


132  f, 

1320  y, 

102960, 

132  =  390 
1320  =    39 


51480 
51480 


=  1 : 10296 


5 : 51480 


•007576 
•000758 
•000097. 


The  mean  error  of  y  is  consequently  -{-0-0275,  and  that  of  z  i  0-0099.    The  element  x 
is  found  by  x  =  £— 10  2  =  —0-0977,  to  which  corresponds  >12  (x)  =  /12(£)  +  100>12(0) 
=  0-0173  =  (0-1315) 2.     For  log  800   we    find    thus   2-9030890  ±  0-0000013,   and    the 
corresponding  difference  of  the  table  is  54-318  +  0-028. 

For  the  sum  of  the  squares  of  the  deviations  we  have,  according  to  (105)— (107), 


=  156  — 9-82-133-64  — 3-15  =  9-39, 


which  shows  that  the  term  of  the  second  degree  contributes  somewhat  to  the  goodness  of 
the  adjustment.  This  sum  of  squares  ought,  according  to  the  number  of  the  observations 
and  the  elements,  to  be  11  —  3  =  8,  with  a  mean  uncertainty  of  +  4. 

The   best  formula  for   computing  the  adjusted  values   of  the  several  observations 
and  their  mean  errors  is  w,  =  ^-\-yt-\-z(tz— 10),  which  gives: 


log  795 

=  2* 

u 
9003688 

0—  U 

+  •12 

(o—  u)* 
•0144 

390 

+  39 

•  25  +  5 

•  225  = 

2490 

Ji(«) 

•0484 

Scale 
•419 

log  796 

=  2-9009136 

-•36 

•1296        390 

+  39 

•16  +  5 

•    36  = 

1194 

•0232 

•722 

log  797 

=  2- 

9014580 

+  •20 

•0400        390 

+  39 

•   9  +  5 

1  — 

746 

•0145 

•826 

log  798 

o. 

9020019 

-•19 

•0361        390 

+  39 

•    4  +  5 

•    36  = 

726 

•0141 

•831 

log  799 

=  2- 

9025457 

+  •43 

•1849 

390 

+  39 

•    1+5 

•   81  = 

834 

•0162 

•806 

log  800 

=•  2- 

9030890 

+  •10 

•0100 

390 

+  39 

•0  +  5 

•100  = 

890 

•0173 

•792 

log  801 

=  2- 

9036321 

-•21 

•0441 

390 

+  39 

•    1  +  5 

•   81  = 

834 

•0162 

•806 

log  802 

=  2- 

9041747 

-•47 

•2209 

390 

+  39 

•   4  +  5 

•    36  = 

726 

•0141 

•831 

log  803 

o. 

9047170 

+  •30 

•0900 

390 

+  39 

•   9  +  5 

•     1  - 

746 

•0145 

•826 

log  804 

=  2- 

9052590 

+  •10 

•0100        390 

+  39 

•  16  +  5 

•   36  = 

1194 

•0232 

•722 

log  805 

=  2- 

9058006 

-•06 

•0036        390 

+  39 

•  25  +  5 

•  225  = 

2490 

•0484 

•419 

•7836 


12870 


8-000 


Both  the  checks  agree:  the  sum  of  squares  is  12x0-7836  =  9-40,  and  the  sum 
of  the  scales  is  11  —  3. 

It  ought  to  be  noticed  that  the  adjustment  gives  very  accurate  results  throughout 
the  greater  part  of  the  interval,  with  the  exception  of  the  beginning  and  the  end.  The 
exactness,  however,  is  not  greatest  in  the  middle,  but  near  the  1st  and  the  3rd  quarter. 

Example  2.  A  finite,  periodic  function  of  one  single  essential  circumstance,  an  angle  F, 
is  supposed  to  be  the  object  of  observation.    The  theory,  consequently,  has  the  form  : 
ov  =  c0-f  cl  cos  F-J-S!  sin  V-\-cz  cos2F-|-  s2sin  2F-J-  .  .  . 


We  assume  that  there  are  n  unbound,  equally  exact  observations  for  a  series  of  values  of  F, 
whose  difference  is  constant  and  =  ~,  for  instance  for  F=  0,  60°,  120°,  180°,  240°,  300°. 
Show  that  the  normal  equations  are  here  originally  free,  and  that  they  admit  of  an  exceedingly 
simple  computation  of  each  isolated  term  of  the  periodic  series. 

Example  3.  Determine  the  abscissae  for  4  points  on  a  straight  line  whose  mutual 
distances  are  measured  equally  exactly,  and  are  unbound.  (Cmp.  Adjustment  by  Correlates, 
Example  3,  and  §  60). 

Example  4.    Three  unbound  observations  must,  according  to  theory,  depend  on  two 

elements,  so  that 

ot  =  a?2,     ^(oj  =  1 

o2  =  xy,     ;8(o8)  =  i 

°a  =  !/2>     ^(o3)  =  1. 

The  theory,  therefore,  does  not  give  us  equations  of  the  linear  form.  This  may  be  produced 
in  several  ways,  most  simply  by  the  common  method  of  presupposing  approximate  values 
of  both  elements,  the  known  a  for  x  and  b  for  y,  and  considering  the  corrections  f  and  y 
to  be  the  elements  of  the  adjustment.  We  therefore  put  x  =  a  -f-  £,  and  y  =  b  -f-  77. 
Rejecting  terms  of  the  2nd  degree,  we  get  the  equations  of  the  observations: 


02  —  ab  =  b£-\-  ay 


where  the  middle  equation  has  still  double  weight.    The  normal  equations  are: 
2a(ot  —  a2)  -t-26(o2—  ab}=  (4a2  -f  2bz)  £  -f 


is  consequently  not  free  of  37,  but  we  find 


«-T 
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For  the  adjusted  value  w8  of  the  middle  observation  we  have 


,       st 

If  we  had  transformed  the  elements  (comp.  §  62)  by  putting 

$  =  «  £  —  6  y 


or 

a:  =  al--     —  bu 


y 
we  should  have  obtained  free  normal  equations 


=  2(a2-f  62)2y. 

If  we  had  placed  absolute  confidence  in  the  adjusting   principle  of  the  sum  of 
squares  as  a  minimum,  a  solution  might  have  been  founded  on 

(o  !  -  «2)2  -f  2  (o,  —  aby  +  (o3  -  ft2)2  =  min. 
The  conditions  of  minimum  are: 
1   d  min 


,   , 
faT  i-  (°t-ab)b  = 

1   drain  ,  ,,  , 

-f~db~  =  (o<i—ab)a  +  (o3  —  b*)b  = 

The  solution  with  respect  to   a  and  b  is  not  very  difficult.    We  see  for  instance 
immediately  that 


or 


a2o3. 


Still  better  is  it  to  introduce  s2  =  a2-f  bz,  by  which  the  equations  become 

(ol  —  sz)a-\-oyb  =  0 
o2rt  +  (o3—  s2)/>  =  0, 
consequently,  ^v 


2      )          \      2.    .. 

If  the  errors  in  0,,  02,  and  03  are  not  large,  olo3  —  o\  must  be  small;  one  of  the 
two  values  of  s2  must  then  be  small,  the  other  nearly  equal  to  o^o^'j  only  the  latter  can 
be  used. 

12 


90 
Further,  we  get: 


'3-°2 


o« — s" 


- 


In  this  way  we  avoid  guessing  at  approximate  values  (for  which  otherwise  we 
should  perhaps  have  taken  a?  =  ol  and  bz  =  o3).  The  values  which  we  have  here  found 
for  a2  and  &2,  and  to  which  may  be  added 

—  ab  =  :— * ^  , 


are  really  exact;  and  if  we  substitute  them  in  the  above  normal  equations,  we  get  £  =  0 
and  tj  =  0. 

Even  when,  as  in  this  case,  the  theory  is  not  linear,  it  is  not  unusual  for  the 
sum  of  the  squares  to  be  a  minimum.  Caution,  however,  is  necessary;  particularly,  it 
may  happen  that  the  sum  of  the  squares  becomes  a  maximum  for  the  found  elements,  or 
for  some  of  them. 

We  may  also  in  another  way  make  the  equations  of  this  example  linear,  namely, 
by  considering  the  logarithms  of  on  o2,  o3  as  the  observed  quantities,  and  finding  the 
logarithms  of  the  elements  from  the  equations  which  will  then  be  linear. 

logOj  =  2  log  a? 
Iogo2  =  log  x  -f  logy 
Iogo3  =  2  logy. 

In  this  way  we  throw  the  difficulty  over  upon  the  squares  of  the  mean  errors.     As 

dz 

log  (z  -4-  dz)  =  log  z  -\     —  , 

z 

we  may  approximately  take 

,*2(log2)    =   ~^^(Z). 

If  a  and  b  also  here  indicate  approximate  values  of  x  and  y,  the  weights  of  the 
3  equations,  respectively,  become  proportional  to  a4,  2a262,  and  b*.  Thus  we  find  the 
normal  equations 

2a4  log  o !  -j-  2a2b2  logo  2  =  (4a4 -f- 2a2i2)  log  a;  -|-  2«2£2logy 

2a262  log  o2  -f  2b4  log  o3  =  2a262  log  x  +  (464  +  2a262)  log  y  , 
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which  give  the  simple  results 
2.og*  -  logo,  - 


,     tag 


This  solution  agrees  only  approximately  with  the  preceding  one.  It  might  seem 
for  a  moment  that,  in  this  way,  we  might  do  without  the  supposition  of  approximate  values 
for  the  elements,  but  this  is  far  from  being  the  case.  For  the  sake  of  the  weights  we 
must,  with  the  same  care,  demand  that  a  and  x,  as  also  b  and  y,  agree,  and  we  must 
repeat  the  adjustment  till  the  squares  of  the  mean  errors  get  the  theoretically  correct 
values.  And  then  it  is  only  a  necessary,  but  not  a  sufficient  condition,  that  x  —  a  and 
y  —  b  are  small.  Unless  the  exactness  of  the  observations  is  also  so  great  that  the  mean 
errors  of  o,  are  small  in  proportion  to  o,  itself,  the  laws  of  errors  of  the  logarithms  cannot 
be  considered  typical  at  the  same  time  as  those  of  the  observations  themselves. 

Example  5.  The  co-ordinates  of  four  points  in  a  circle  are  observed  with  equal 
mean  errors  and  without  bonds  :  #x  =  20,  yl  =  10;  #2  =  16,  yz  =  18;  #„  =  3,  ya  =  17; 
and  x4  =  2,  yt  =  4.  In  the  adjustment  for  the  co-ordinates  a  and  b  of  the  centre  and 
the  radius  r,  we  cannot  use  the  common  form  of  the  equations 

(X-a)*  +  (y-bY  =  r«, 

because  it  embraces  more  than  one  observed  quantity  besides  the  elements.  In  order  to 
obtain  the  separation  of  the  observations  necessary  for  adjustment  by  elements,  we  must 
add  a  supplementary  element,  or  parameter,  F<  for  each  point,  writing  for  instance 

Xi  =  a  -)-  r  cos  F,  ,     */,•  =  b  -j-  -r  sin  F,  . 

As  the  equations  are  not  linear  we  must  work  by  successive  corrections  A  a,  A  b, 
A  r,  A  Vi  of  the  elements,  of  which  the  first  approximate  system  can  be  obtained  by 
ordinary  computation  from  3  points.  For  the  theoretical  corrections  A  xt  and  A  y,  of  the 
co-ordinates  we  get  by  differentiation  of  the  above  equations 

A  #,-  =  A  a  -f-  A  r  •  cos  F,  —  A  F,-  •  r  sin  F, 
A  yt  =  A  b  -f  A  r  -  sin  Fi-  -+-  A  F,-  •  r  cos  F,-  . 

These  equations  for  the  observations  lead  us  to  a  system  of  seven  normal  equations. 
By  the  "method  of  partial  elimination"  (§  61)  these  are  not  difficult  to  solve,  but  here  the 
simplicity  of  the  problem  makes  it  possible  for  us  immediately  to  discover  the  artifice. 
We  know  that  every  transformation  of  equally  well  observed  rectangular  co-ordinates  results 
in  free  functions.  The  radial  and  the  tangential  corrections 

A  xt  cos  Vi  -f-  A  yi  sin  F,-  =  A  «,- 
and 

A  Xi  sin  Vi  —  A  yt  cos  F,  =  A  tt 

12* 
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can,   consequently,   here  be  taken  directly  for  the  mean   values   of  corrections  of  observed 
quantities',  and  as  only  the  four  equations 

A  ti  =  A  a  sin  Vt  —  A  b  cos  F,  —  r  A  F{ 

contain  the  four  corrections  A  F,-  of  the  parameters,  they  can  be  legitimately  reserved  for 
the  successive  corrections  of  the  elements.     In  this  way 

A  Hi  =  A  a  cos  Vi  +  A  b  sin  F,-  +  A  r 

with  equal  mean  errors,   ^2(w)  =  ^2(a?)  =  ^2(#),  are  the  "equations  for  the  observations" 
of  this  adjustment,  and  give  the  three  normal  equations: 

[A  n  cos  F]  =  A  a  [cos2  F]  -f-  A  6  [cos  F  sin  F]  -j-  A  r  [cos  F] 
[A  n  sin  F]  =  A  a  [cos  F  sin  F]  -f  A  J  [sin2  F]  -f-  A  r  [sin  F] 
[A  «]  =  A  a  [cos  F]  -f  A  6  [sin  F]  +  A  r  -  4. 

In  the  special  case  under  consideration,  we  easily  see  that  the  first,  second,  and 
fourth  point  lie  on  the  circle  with  r  =  10,  whose  centre  has  the  co-ordinates  a  =  10  and 
6  =  10;  the  parameters  are  consequently: 

Vl  =  0°0'0,    F2  =  53°7'8,    F3  =  135°  O'O,    and    F4  =  216°52'2. 

For  the  third  point  the  computed  co-ordinates  are:  o;3  =  2-9290  and  «/3  =  17-0710, 
consequently,  A  x3  =  +0-0710  and  A</3  =  —0-0710,  A  t3  =  0,  and  A  «8  =  —0-1005; 
all  other  differences  A  a;,-  =-  0  and  A  yt  =  0.  The  "equations  for  the  observations"  are  : 

1-0000  A  a  +  0-0000  A  b  +  1-0000  A  r  =       0-0000 

0-6000  A  a  +  0-8000  A  b  +  1  -0000  A  r  =       0-0000 

-0-7071  A  a  +  0-7071  A  6  -f  1-0000  A  r  =  —0-1005 

-  0-8000  A  a  —  0-6000  A  6  +  1  -0000  A  r  =       0-0000. 

The  normal  equations  are: 

2-5000  A  a  +  0-4600  A  b  +  0-0929  A  r  =     +  0-0710 
0-4600  A  a-  -f-  1-5000  A  b  -f  0-9071  A  r  =  —  0-0710 
R  =  0-0929  Aa  +  0-907  1A6  +  4-0000  Ar  =  —0-1005. 
By  elimination  of  A  r  we  get 

2-4978  A  a  +  0-4390  A  b  =  +0-0733 

B  =  0-4390  A  a  +  1-2943  A  6    •  =  —  0-0482  ; 

and  by  eliminating  A  b 

A  =  +2-3490  A  a  =  +0-0896. 

From  .ft,  5,  and  A  we  compute 

=  +  0-0381,    A  6  =  —0-0501,    and    Ar  =  —  0-01465. 


For  checking 
+  0-0002 
0-0000 
0-0000 

+  0-0001 
—  0-0001 

o-oooo 


The  checks   are  found  by  substitution  of  these  in  the  several  equations.    The  4  equations 
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for  the  observations  give  the  following  adjusted  values  of  An/: 

A»i  = +0-0234,  A w2  =  —0-0319,  A  wa=— 0-0770,  and  A/t«  —  —  00151: 

the  sum  of  squares  (here  =  (8  —  7)/l2)  is  consequently 

I      *z     J 

=  (0-0234)2  +  (0-0319)2  +  (0-0235)2  +  (0-0151)2  =  0-00235. 
For  this,  by  the  equation  (108),  we  get 

0-01010  —  0-00271  —  0-00356  —  0-00147  =  0-00236 

as  the  final  check  of  the  adjustment. 

The  4  equations  for  A  tt  give  us 

AFt  =  4-17'2,   AK2  —  -f-20'8,   A F8  =  — 2'9,    and    AF4  =   -21'6. 
Thus,  by  addition  of  the  found  corrections  to  the  approximate  values, 

r  =  9-98535,    a  =  10-0381,     b  =  9-9499, 
Vl  ==  0°17'2,    F2  =  53°28'6,    F3  =  134°57'1,    and    F4  ==  216°30'6, 

we  have  the  whole  system  of  elements  for  the  next  approximation,  if  they  are  not  the 
definitive  values.  In  both  cases  we  must  compute  by  them  the  adjusted  values  of  the  co- 
ordinates, according  to  the  exact  formulce;  the  resulting  differences,  obs. — comp.,  are: 

Point         A  a;  Ay  Aw  A  t 

1  -0-0232    +0-0002      -0-0232    +0-0002 

2  +0-0191     +0-0257     +0-0320         0-0000 

3  +0-0166      -0-0166      -0-0234      -0-0001 

4  -0-0123      -0-0090    +0-0152         0-0000. 

The  sum  of  the  squares,  [(Aoj)2  +  (Ay)2]  =  0-00236,  agrees  with  the  above  value, 
which  indicates  that  the  approximation  of  this  first  hypothesis  may  have  been  sufficient. 
Indeed,  the  students  who  will  try  the  next  approximation  by  means  of  our  final  differences, 
will,  in  this  case,  find  only  small  corrections. 

From  the  equations  A,  B,  and  .ft,  which  express  the  free  elements  by  the  original 
bound  elements,  A  a,  A£,  A  r,  we  easily  compute  the  equations  for  the  inverse  trans- 
formation: 

A  a  =       0-4257  •  A 

A  b  =      -0-1444-^  +  0-7726.5 

A  r  =         0-0228  •  A  —  0-1752  -  B  +  0-25  •  R. 

By  these,  any  function  of  the  elements  for  a  given  parameter  can  be  expressed  as  a  linear 
function  of  the  free  functions  A,  B,  and  R;  and  by  ^(A)  =  2-3490 A 2,  ^(B)  =  1-2943 >12, 
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and  >}2(Jft)  =  4^2,  the  mean  error  is  easily  found.    Thus  the  squares  of  the  mean  errors  of 
the  co-ordinates  x  and  11  are 

*/ 

(x)  =  {2-3490  (    0-4257  +  0-0228  cos  F)2  -f 1  -2943  (-0-1752  cos  F)2  -f  4  (0-25  cos  F)2}/1. 

=  {2-3490(— 0-1444  +  0-0228  sin  F)2  + 1-2943  (    0-7726  —  0-1752  sin  F)2  -f  4  (0-25  sin  F)2}>1, 

Only  the  value   ^2  =  0-00236,    found   by  the   summary    criticism,    is    here    very 
uncertain. 


XIII.     SPECIAL  AUXILIARY  METHODS. 

§  57.  We  have  often  occasion  to  use  the  method  of  least  squares,  particularly 
adjustment  by  elements;  and  this  sometimes  requires  so  much  work  that  we  must  try  to 
shorten  it  as  much  as  possible,  even  by  means  which  are  not  quite  lawful.  Several  temp- 
tations lie  near  enough  to  tempt  the  many  who  are  soon  tired  by  a  somewhat  lengthened 
computation,  but  not  so  much  by  looking  for  subtleties  and  short  cuts.  And  as,  moreover, 
the  method  was  formerly  considered  the  best  solution  —  among  other  more  or  less  good  - 
not  the  only  one  that  was  justified  under  the  given  supposition,  it  is  no  wonder  that  it 
has  come  to  be  used  in  many  modifications  which  must  be  regarded  as  unsafe  or  wrong. 
After  what  we  have  seen  of  the  difference  between  free  and  bound  functions,  it  will  be 
understood  that  the  consequences  of  transgressions  against  the  method  of  least  squares 
stand  out  much  more  clearly  in  the  mean  errors  of  the  results  than  in  their  adjusted 
values.  And  as  —  to  some  extent  justly  —  more  importance  is  attached  to  getting  tolerably 
correct  values  computed  for  the  elements,  than  to  getting  a  correct  idea  of  the  uncertainty, 
the  lax  -morals  with  respect  to  adjustments  have  taken  the  form  of  an  assertion  to  the 
eifect  that  we  can,  within  this  domain,  do  almost  as  we  like,  without  any  great  harm, 
especially  if  we  take  care  that  a  sum  of  squares,  either  the  correct  one  or  another,  becomes  a 
minimum.  This,  of  course,  is  wrong.  In  a  text-book  we  should  do  more  harm  than  good 
by  stating  all  the  artifices  which  even  experienced  computers  have  allowed  themselves  to 
employ,  under  special  circumstances  and  in  face  of  particularly  great  difficulties.  Only 
a  few  auxiliary  methods  will  be  mentioned  here,  which  are  either  quite  correct  or  nearly 
so,  when  simple  caution  is  observed. 

§  58.  When  methodic  adjustment  was  first  employed,  large  numbers  of  figures 
were  used  in  the  computations  (logarithms  with  7  decimal  places),  and  people  often  com- 
plained of  the  great  labour  this  caused;  but  it  was  regarded  as  an  unavoidable  evil,  when 
the  elements  were  to  be  determined  with  tolerable  exactness.  We  can  very  often  manage, 
however,  to  get  on  by  means  of  a  much  simpler  apparatus,  if  we  do  not  seek  something 
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which  cannot  bo  determined.  During  the  adjustment  properly  so  called,  we  ought  to  be 
able  to  work  with  three  figures.  But  this  ideal  presupposes  that  two  conditions  are  satis- 
tied:  the  elements  we  seek  must  be  small  and  free  of  one  another,  or  nearly  so;  and  in 
both  respects  it  can  be  difficult  enough  to  protect  oneself  in  time  by  appropriate  trans- 
formation. Often  it  is  only  through  the  adjustment  itself  that  we  learn  to  know  the 
artifices  which  would  have  made  the  work  easy.  This  applies  particularly  to  the  mutual 
freedom  of  the  elements.  The  condition  of  their  smallness  is  satisfied,  if  we  everywhere  use 
the  same  preparatory  computation  as  is  necessary  when  the  theory  is  not  of  linear  form. 

By  such  means  as  are  used  in  the  exact  mathematics,  or  by  a  provisional,  more 
or  less  allowable  adjustment,  we  get,  corresponding  to  the  several  observations  o,  .  ..<)„, 
a  set  of  values  vl  . . .  vn  which  are  computed  by  means  of  the  values  x0  . . .  z0  of  the 
several  elements  a? . . .  z,  and  which,  while  they  satisfy  all  the  conditions  of  the  theory  with 
perfect  or  at  any  rate  considerable  exactness,  nowhere  show  any  great  deviation  from  the 
corresponding  observed  value.  It  is  then  these  deviations  o,—  vt  and  x  —  x0  . ..  which  are 
made  the  object  of  the  adjustment,  instead  of  the  observations  and  elements  themselves 
with  which,  we  know,  they  have  mean  error  in  common.  When  in  a  non-linear  theory 
the  equations  between  the  adjusted  observation  and  the  elements  are  of  the  general  form 

Ui  =  F  (x, . . .  z) , 
they  are  changed  into 

-  —  -  ©  o<*  -  *o)  +  •  •  -  +  (f )  o(»  -  «.)  (109, 

by  means  of  the  terms  of  the  first  degree  in  Taylor's  series,  or  by  some  other  method  of 
approximation.  If  the  equations  are  linear 

Ui  =  ptx-\-  . . .  -f-r<js, 
we  have,  without  any  change,  for  the  deviations: 

Ui-Vi  =  pt(X  —  x0)-\-  ...  +  rt(z  —  z0).  (110) 

No  special  luck  is  necessary  to  find  sets  of  values,  #,-,  . . .  x0,  . . .  20,  whose  devia- 
tions Oi  —  Vi  show  only  two  significant  figures;  and  then  computation  by  3  figures  is,  as 
far  as  that  goes,  sufficient  for  the  needs  of  the  adjustment. 

The  method  certainly  requires  a  considerable  extra-work  in  the  preparatory  com- 
putation, and  it  must  not  be  overlooked  that  computations  with  an  exactness  of  many 
decimal  places  will  often  be  necessary  in  this  part;  especially  #,  ought  to  be  computed  with 
the  utmost  care  as  a  function  of  XQ  . . .  ZQ  ,  lest  any  uncertainty  in  this  computation  should 
increase  the  mean  errors,  so  that  we  dare  not  put  /I2(o— v)  =  A2(o). 

This  additional  work,  however,  is  not  quite  wasted,  even  when  the  theory  is  linear. 
The  list  of  the  deviations  o,  —  v,  will,  by  easy  estimates,  graphic  construction,  or  directly 
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by  the  eye,  with  tolerable  certainty  lead  to  the  discovery  of  gross  errors  in  the  series  of 
observations,  slips  of  the  pen,  etc.,  which  must  not  be  allowed  to  get  into  the  adjust- 
ment. The  preliminary  rejection  of  such  observations  may  save  a  whole  adjustment;  the 
ultimate  rejection,  however,  falls  under  the  criticism  after  the  adjustment. 

In  computing  the  adjusted  values,  particularly  uf,  after  the  solution  of  the  normal 
equations,  we  ought  not  to  rely  too  confidently  on  the  transformation  of  the  equations  into 
linear  form  or  into  equations  of  deviations  for  Oi  —  Vi.  Where  it  is  possible,  the  actual 
equations  U{  —  F(x,  . . .  z)  ought  to  be  employed,  and  with  the  same  degree  of  accuracy 
as  in  the  computation  of  «?,-.  In  this  way  only  can  we  see  whether  the  approximate  system 
of  elements  and  values  has  been  so  near  to  the  final  result  as  to  justify  the  rejection  of 
the  higher  terms  in  Taylor's  series.  If  not,  the  adjustment  may  only  be  regarded  as 
provisional,  and  must  be  repeated  until  the  values  of  w,-,  got  by  direct  computation, 
agree  with  the  values  through  Ui  —  Vi  in  the  linear  equations  of  adjustment. 

On  the  whole  the  adjustment  ought  to  be  repeated  frequently  till  we  get  a  sufficient 
approximation.  This,  for  instance,  is  the  rule  where  the  observations  represent  probabilities, 
for  which  Xz(ot]  ^  generally  known  only  as  functions  of  the  unknown  quantities  which 
the  adjustment  itself  is  to  give  us. 

§  59.  The  form  of  the  theory,  and  in  particular  the  selection  of  its  system  of 
elements,  is  as  a  rule  determined  by  purely  mathematical  considerations  as  to  the 
elegance  of  the  formulae,  and  only  exceptionally  by  that  freedom  between  the  elements 
which  is  wanted  for  the  adjustment.  On  the  other  hand  it  will  generally  be  impossible 
to  arrange  the  adjustment  in  such  a  way  that  the  free  elements  with  which  it  ends,  can 
all  be  of  direct,  theoretical  interest.  A  middle  course,  however,  is  always  desirable,  for  the 
reasons  mentioned  in  the  foregoing  paragraph,  and  very  frequently  it  is  also  possible,  if 
only  the  theory  pays  so  much  respect  to  the  adjustments  that  it  avoids  setting  up,  in  the 
same  system,  elements  between  which  we  may  expect  beforehand  that  strong  bonds  will 
exist.  Thus,  in  systems  of  elements  of  the  orbits  of  planets,  the  length  of  the  nodes  and 
the  distance  of  the  perihelion  from  the  node  ought  not  both  to  be  introduced  as  elements; 
for  a  positive  change  in  the  former  will,  in  consequence  of  the  frequent,  small  angles  of 
inclination,  nearly  always  entail  an  almost  equally  large  negative  change  in  the  latter.  If 
a  theory  says  that  the  observation  is  a  linear  function  of  a  single  parameter,  t,  the  formula 
ought  not  to  be  written  u=p-\-qt,  unless  all  the  fs  are  small,  some  positive,  and  others 
negative,  but  u  =  rjrq(t  —  t0),  where  tQ  is  an  average  of  the  parameters  corresponding  to 
the  observations.  If  we  succeed,  in  this  way,  in  avoiding  all  strongly  operating  bonds, 
and  this  can  be  known  by  the  coefficients  of  all  the  normal  equations  outside  the  diagonal 
line  becoming  numerically  small  in  comparison  with  the  mean  proportional  between  the 
two  corresponding  coefficients  in  the  diagonal  line,  then  we  have  at  any  rate  attained  so 
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much  that  we  need  not  use  in  the  calculations  for  the  adjustment  many  more  decimal 
places  than  about  the  3,  which  will  always  be  sufficient  when  the  elements  are  originally 
mutually  free,  and  not  during  the  adjustment  are  first  to  be  transformed  into  freedom 
with  painful  accuracy  in  the  transformation  operations. 

If,  by  careful  selection  of  the  elements,  we  even  get  so  far  that  no  sum  of  the 
products  [pq]1)  in  numerical  value  exceeds  about  ^  of  the  mean  proportional  between  the 
corresponding  sums  of  squares  V^[#p]  [qq] ,  or  in  many  cases  only  ^  of  these  amounts, 
then  we  may  consider  the  bonds  between  the  elements  insignificant.  The  normal  equations 
themselves  may  then  be  used  to  determine  the  law  of  error  for  the  elements;  we  compute 
provisionally  a  first  approximation  by  putting  all  the  small  sums  of  products  =  0,  and  in 
the  second  approximation  we  correct  the  [j9o]'s  by  substituting  the  sums  of  the  products 
and  the  values  of  the  elements  as  found  in  the  first  approximation.  For  instance: 

[po]  —  [pq]  yi-  -  [pr]  z,   -  =  [pp]  xz  (111) 

while 

*i(*t)  =  r^!tei+  [/4V     +W1  =          <112> 

[pp]  \  [qq]  [rr]  ) 


[pq]*  [pr] 


•)• 


(113) 


[qq]  M 

As  the  errors  in  these  determinations  are  of  the  second  order,  it  will  not,  if  the  o's 
themselves  are  small  deviations  from  a  provisional  computation,  be  necessary  to  make  any 
further  approximations. 

Even  if  the  bonds  between  the  elements,  which  are  stated  in  terms  of  the  sums 
of  the  products,  are  stronger,  we  can  sometimes  get  them  untied  without  any  transforma- 
tion. If  we  can  get  new  observations,  which  are  just  such  functions  of  the  elements  that 
the  sums  of  the  products  will  vanish  if  they  are  also  taken  into  consideration,  we  will  of 
course  put  off  the  adjustment  until,  by  introducing  them  into  it,  we  cannot  only  facilitate 
the  computation,  but  also  increase  the  theoretical  value  and  clearness  of  the  result.  And 
if  we  can  attain  freedom  of  the  elements  by  rejecting  from  a  long  series  of  observations 
some  single  ones,  we  do  not  hesitate  to  use  this  means;  especially  as  such  unused  observa- 
tions may  very  well  be  employed  in  the  criticism.  If,  for  instance,  an  arctic  expedition 
has  made  meteorological  observations  at  some  fixed  station  for  a  little  more  than  a  com- 
plete year,  we  shall  not  hesitate  in  the  adjustment,  by  means  of  periodical  functions,  to 
leave  out  the  overlapping  observations,  or  to  make  use  of  the  means  of  the  double  values, 
giving  them  the  weight  of  single  observations. 


')  In  what  follows  we  write,  for  the  sake  of  brevity,  [p<?]  for  py-J. 

13 
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§  60.  Though  of  course  the  fabrication  of  observations  is,  in  general,  the  greatest 
sin  which  an  applied  science  can  commit,  there  exists,  nevertheless,  a  rather  numerous  and 
important  class  of  cases,  in  which  we  both  can  and  ought  to  use  a  method  which  just 
depends  on  the  fabrication  of  such  observations  as  might  bring  about  the  freedom  of  the 
theoretical  elements.  As  a  warning,  however,  against  misuse  I  give  it  a  harsh  name:  the 
method  of  fabricated  observations. 

If,  for  instance,  we  consider  the  problem  which  has  served  us  as  an  example  in  the 
adjustment,  both  by  correlates  and  by  elements,  viz.  the  determination  of  the  abscissae  for 
4  points  whose  6  mutual  distances  have  been  measured  by  equally  good,  bondfree  observa- 
tions, we  can  scarcely  after  the  now  given  indications  look  at  the  normal  equations, 

0  j  g      |"   ^  i  3  ~\~  0  \  4      =  OX  j  1^2  *• ""  3  3?  4 
12      I         23      I         24     =                    "^  1      I     Ot/?  2          -I  "^  3           J-  **-'  4 

—  Oj  3          02  3  -j-  034     =  IX  j          la?  2     |     "«*-3          1^4 

0 1  4  24  ^34     =   =  "^  1  3? 2  •*•  3J  3  "|     ""^41 

without  immediately  feeling  the  want  of  a  further  observation: 

0  =  lxl  +  lxz  -\-  Ix3  -(-  Icr4  , 

which,  if  we  imagine  it  to  have  the  same  weight  =  1  as  each  of  the  measurements  of 
distance  ^(0™)  =  xr  —  xs,  will  give  by  addition  to  the  others,  but  without  specifying  the 
value  of  0, 

^  +  °i2  +  °i3+°i4   =  4a?! 

°  —  °12  +  °23   +°24     =    4iK2 

0  —  0,3  —  o23  +o34  =  4a?3 
0  — o14  — o24  — o34  ==  4a;4, 

and  consequently  determine  all  4  abscissae  as  mutually  free  and  with  fourfold  weight. 

What  in  this  and  other  cases  entitles  us  to  fabricate  observations  is  indeter- 
minateness  in  the  original  problem  of  adjustment  --  here,  the  impossibility  of  determining 
any  of  the  abscissae  by  means  of  the  distances  between  the  points.  When  we  treat 
such  problems  in  exact  mathematics  we  get  simpler,  more  symmetrical,  and  easier  solu- 
tions by  introducing  values  which  can  only  be  determined  arbitrarily;  and  so  it  is  also  in 
the  theory  of  observation.  But  the  arbitrariness  gets  here  a  greater  extent,  because  not 
only  mean  values,  but  also  mean  errors  must  be  introduced  for  greater  convenience.  And 
while  we  can  always  make  use  of  a  fabricated  observation  in  indeterminate  problems  for 
the  complete  or  partial  liberation  of  the  elements,  we  must  here  carefully  demonstrate, 
by  criticism  in  each  case,  that  the  fabrication  we  have  used  has  not  changed  anything 
which  was  really  determined  without  it. 
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In  the  above  example,  this  is  seen  in  the  first  place  by  0  disappearing  from  all 
the  adjusted  values  for  the  distances  orr  — a-,,  and  then  by  O's  own  adjusted  value, 
determined  as  the  sum  xl-\-xt~\-sca-\-x4,  and  leading  only  to  the  identity  O«=0.  The 
adjustment  will  consequently  neither  determine  0  nor  let  it  get  any  influence  on  the 
nther  determinations.  The  mean  errors  show  the  same  and,  moreover,  in  such  a  way  that 
the  criterion  becomes  independent  of  whether  0  has  been  brought  into  the  computation 
as  an  indeterminate  number  or  with  an  arbitrary  value,  for,  after  the  adjustment  as  well 
as  before,  we  have  for  0t  /12(0)  «=  1.  The  scale  for  0  is  consequently  «=  0,  and  this  is 
also  generally  a  sufficient  proof  of  our  right  to  use  the  method  of  fabricated  observations. 

§  61.  The  method  of  partial  eliminations.  When  the  number  of  elements  is 
large,  it  becomes  a  very  considerable  task  to  transform  the  normal  equations  and  eliminate 
the  elements.  The  difficulty  is  nearly  proportional  to  the  square  of  that  number.  Long 
before  the  elements  would  become  so  numerous  that  adjustment  by  correlates  could  be 
indicated,  a  correct  adjustment  by  elements  can  become  practically  impossible.  The  special 
criticism  is  quite  out  of  the  question,  the  summary  criticism  can  scarcely  be  suggested,  and 
the  very  elimination  must  be  made  easier  at  any  price.  If  it  then  happens  that  some  of 
the  elements  enter  into  the  expressions  for  some  of  the  observations  only,  and  not  at  all  in 
the  others,  then  there  can  be  no  doubt  that  the  expedient  which  ought  first  to  be  employed 
is  the  partial  elimination  (before  we  form  the  normal  equations)  of  such  elements  from  the 
observations  concerning  them.  These  observations  will  by  this  means  be  replaced  by  certain 
functions  of  two  observations  or  more,  which  will  generally  be  bound;  and  they  will  be 
so  in  a  higher  and  more  dangerous  degree  the  fewer  elements  we  have  eliminated.  By 
this  proceeding  we  may,  consequently,  imperil  the  whole  ensuing  adjustment,  the  foundation 
of  which,  we  know,  is  unbound  or  free  observations  as  functions  of  its  elements. 

If  now  it  must  be  granted  that  the  difficulties  can  become  so  great  that  we  cannot 
insist  on  an  absolute  prohibition  against  illegitimate  elimination,  we  must  on  the  other 
hand  emphatically  warn  against  every  elimination  which  is  not  performed  through  free 
functions,  and  much  the  more  so,  as  it  is  quite  possible,  in  a  great  many  cases  in  which 
abuses  have  taken  place,  to  remain  within  the  strictly  legitimate  limits  of  the  free  functions, 
by  the  use  of  "the  method  of  partial  eliminations". 

This  is  connected  with  the  cases,  in  which  some  of  the  observations,  for  instance 
ol  . . .  om,  according  to  the  theory,  depend  on  certain  elements,  for  instance  x, . . .  y,  which 
do  not  occur  in  the  theoretical  expression  for  any  other  of  the  observations.  Our  object  is 
then,  by  the  formation  of  the  normal  equations  to  separate  o,  . . .  om  as  a  special  series  of 
observations.  We  begin  by  forming  the  partial  normal  equations  for  this,  and  then  imme- 
diately perform  the  elimination  of  :r,  . . .  y  from  them,  without  taking  into  consideration 
whether  these  equations  alone  would  be  sufficient  for  a  determination  of  the  other  elements. 

13* 
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As  soon  as  x  . . .  y  are  eliminated,  the  process  of  elimination  is  suspended.  The  trans- 
formed equations  containing  these  elements  (which  now  represent  functions  that  are  free  of 
all  observations,  and  functions  which  depend  only  on  the  remaining  elements  2,  . . .  M),  are 
put  aside  till  we  come  back  to  the  determination  of  x  ...  y.  The  other  partially  transformed 
normal  equations,  originating  in  the  group  o^  . . .  om,  are  on  the  other  hand  to  be  added, 
term  by  term,  to  the  normal  equations  for  the  elements  2, . . .  M,  formed  out  of  the  remain- 
ing observations,  before  the  process  of  elimination  is  continued  for  these  elements. 

That  this  proceeding  is  quite  legitimate  becomes  evident  if  we  imagine  the 
elements  x  . . .  y  transformed  into  the  elements  x' . .  .  y',  which  are  free  of  z  ...  u,  and  then 
imagine  x' . . .  y'  inserted  instead  of  x  . . .  y  in  the  original  equations  for  the  observations. 
For  then  all  the  sums  of  products  with  the  coefficients  of  x' . . .  y'  will  identically  become 
=  0,  and  the  sums  of  squares  and  sums  of  products  for  the  separated  part  of  the  observa- 
tions will,  as  addenda  in  the  coefficients  of  the  normal  equations  (compare  (57)),  come  out, 
immediately,  with  the  same  values  as  now  the  transformed  normal  equations. 

As  an  example  we  may  treat  the  following  series  of  measurements  of  the  position 
of  3  points  on  a  straight  line.  The  mode  of  observation  is  as  follows.  We  apply  a  millimeter 
scale  several  times  along  the  straight  line,  and  then  each  time  read  off  by  inspection  with 
the  unaided  eye  either  the  places  of  all  the  points  against  the  scale  or  the  places  of  two 
of  them.  The  readings  for  each  point  are  found  in  its  separate  column,  and  those  on  the 
same  row  belong  to  the  same  position  of  the  scale.  (Considered  as  absolute  abscissa- 
observations  such  observations  are  bound  by  the  position  of  the  zero  by  every  laying 
down  of  the  scale;  but  these  bonds  are  evidently  loosened  by  our  taking  up  the  position 
against  the  scale  of  an  arbitrarily  selected  fixed  origin  yr  as  an  element  beside  the  abscissae 
#!,  #2,  #3  of  the  three  points).  All  mean  errors  are  supposed  to  be  equal. 


Position 
of 
the  Scale 

1 

Point 

Eliminated  free  Elements 

n.99   _  -    //        1     i  (r      \    r  \ 
—  —    —   -    y  \     '  \     y  \^  i     [     *^  2  / 

I 

mm. 

6-9 

II 

mm. 

27-54 

III 

mm. 

2 

8-35 

54-95 

31-65  =  ;«/2  -H-Vi  +  ^s) 

3 

7-9 

54-5 

31-20  ==  y.A   -f-Ka^+aJa) 

4 

21-16 

47-2 

34-18  -1  y,  +i  (*,  +  *„) 

5 

10-74 

36-7 

23-72  ==  yb  +M^2  +  *3) 

6 

4-06 

30-1 

17-08  =  y,  -f  Ito  +  aj.) 

7 

31-45 

51-98 

78-06 

COOO                                   !1/~.          I 

Oo-oo  =  y-j  -\-  -j(#i  -f-  %%- 

8 

32-9 

53-5 

79-5 

?Sfv30  -  -  11       1    *  (r    -\-  r    - 

IJtJ    tJ\J      —    —       M  o                    ~T  1  *v  |         [        t*'  2 

9 

9-6 

30-3 

56-22 

32-04  =  2/9  +  !(#!  +  #2- 

10 

20-16 

40-78 

66-8 

4^'0o  =  y  \  o~i  TT  v^i  ~\  *^2  ~ 

11 

18-9 

39-5 

65-56 

41.32  =  yn-f  1(0?!  +  ^- 

2. 

OQ 
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As  the  theoretical  equation  for  the  *th  observation  in  the  «th  column  lias  the  form 


and  every  observation,  therefore,  is  a  function  of  only  two  elements,  there  is  every  reason 
to  use  the  method  of  partial  elimination.  If  we  choose  first  to  eliminate  the  y's,  we  have 
consequently  to  form  normal  equations  for  each  of  the  11  rows.  Where  only  two  points 
are  observed  these  normal  equations  get  the  form 

or-\~o,  =  2y,  -f-  xr  -f  xt 

Or   =          i         Xr 


for  three  points  the  form  of  the  normal  equations  is 

°i  +  02  +  °3  =  =  %«  +  xl  +  x.£  -f  x 


Of  these  equations  those  referring  to  the  «/,-  have  given  the  eliminated  free  elements 
stated  above  to  the  right  of  the  observations  after  the  perpendicular. 

By  subtracting  these  equations  from  the  corresponding  other  equations  we  get, 
in  the  cases  where  there  are  2  points: 

Or  —  £(°r-f0«)    =  kXr—  kX* 

0,  —  |(0r-f0,)    ^    —  l^r  +  l^, 

and  in  cases  where  there  are  3  points  : 


By  forming  the  sum  of  these  differences  for  each  column,  and  counting,  on  the 
right  side  of  the  equations,  how  often  each  element  occurs  with  one  other  or  with  two 
others,  we  consequently  get  the  ultimate  normal  equations: 

-168-98  =         f^-f^-^, 

3<'71    =      -y^1+Ja;2  —  ~6X3 

+  206-69=    -^  _*•*,  +  »*,. 

The  case  is  here  simple  enough  to  be  solved  by  a  fabricated  observation.  How  is 
its  most  advantageous  form  found,  when  its  existence  is  given? 


ADSWer:  =         +      +       '    "eight  =  23712, 
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after  which  we  get  the  normal  equations: 

£!.o -168-98  =•  JS*i 

o  t '  l  L    —     «„   X  a 


759 


^  o  +  206-69 


X.. 


consequently, 

Xl  =  o  —  25™38  ,    a?a  =  o  —  4-77 
From  these  we  now  compute  the  */'s: 

*W  i  OM   £istj  ^   l  v  7 

?/2  =  33-72   --o,         7/8 

?/3  =  33-27     -  o  ,         y9 

y4  =  25-945  —  o  , 

y5  =  15-485  —  0, 

ya  =    8-845  —  0, 

We  need  not  here  state  the  adjusted  values  for  the  several  observations,  nor  their 
differences,  of  which  it  is  enough  to  say  that  their  sum  vanishes  both  for  each  row  and 
for  each  column;  their  squares,  on  the  other  hand,  will  be  found  to  be: 


and   xs  =  o  -f  21-24. 


y7      =  56-80  -  o  , 

ys    =  58-27  —  0, 

ya    =  35-01  —  o  , 

y10  ==  45-55  —  o, 

yll  =  44-29  —  0. 


I 

II 

III 

Total: 

•0002 

•0002 

•0004 

1 

•0001 

2 

1 

1 

2 

2 

2 

4 

6 

6 

12 

2 

2 

4 

9 

25 

4 

38 

1 

0 

1 

2 

9 

36 

9 

54 

1 

0 

1 

2 

1 

4 

9 

14 

Total:     -0025 

•0077 

•0036 

•0138 

2'  =  -0028 


2'  —  -0110 


For  the  summary  criticism  we  notice  that  the  number  of  observations  is  27,  the 
number  of  the  elements  is  3  +  11  — 1  =  13,  divisor  consequently  =  14  (one  element  being 
wholly  engaged  by  the  fabricated  observation  o).  The  unit  of  the  mean  error  is  therefore 
determined  by  Ez  ==  0-0010,  and  the  mean  error  on  single  reading  i  0-032,  which  agrees 
well  with  what  we  may  expect  to  attain  by  practice  in  estimates  of  tenth  parts. 
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As  to  special  criticism  it  is  here,  where  the  weights  of  the  eliminated  free 
functions  are  respectively  2  and  3  times  the  weight  of  the  single  observation,  while  the 
weights  of  #, ,  #2,  and  xa  after  the  adjustment  become  respectively  ^-,  ~,  and  7^,  very 
c;is\  to  compute  the  scales 


1  — 


1  — 


/1 2(o)  Weight  after  the  adjustment  * 

With  759  as   common   denominator  we  find  for  the  several  scales  and  the  sums  of  their 

most  natural  groups: 

/ 

I  II  III 


=-  399G 


1 

327 

327 

654 

2 

331-5 

331-5 

663 

3 

331-5 

331-5 

663 

4 

336 

336 

672 

5 

336 

336 

672 

6 

336 

336 

672 

7 

436 

442 

448 

1326 

8 

436 

442 

448 

1326 

9 

436 

442 

448 

1326 

10 

436 

442 

448 

1326 

11 

436 

442 

448 

1326 

3170 

3545 

3911 

10626 

2  =  6630 


The  comparison  with  the  sums  of  squares  in  the  groups,  divided  by  Ez,  shows  then  for 
point  1  2-5  instead  of  ^~  =  4-2il/£H,  for  point  II  7-7  instead  of  4-7=Jrl/9T4~,  for 
point  III  3-6  instead  of  5-1^1/10-2,  for  all  positions  of  the  scale  with  two  readings 


2-8  instead  of  5-3^1/10-6  ,  and  for  positions  with  3  readings  11-0  instead  of  8-7 
The  limit  of  the  mean  error  is  consequently  reached  only  in  the  group  of  point  II,  where 
(7-7  —  4-7)2  =  9-0  <  9-4,  and  it  is  nowhere  exceeded.  We  have  a  check  by  summing 
the  scales: 

»  14  =  27-11-3+1. 


§  62.  In  such  cases  in  which  the  circumstances  and  weights  of  the  observations 
are  distributed  in  some  regular  way,  this  will  often  facilitate  the  treatment  of  the  normal 
equations.  The  elimination  of  the  elements  and  the  transformation  of  the  normal  equations 
into  such  whose  left  hand  sides  can  be  regarded  as  unbound  observations,  as  they  are  free 
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functions  of  the  original  observations,  need  not  always  be  so  firmly  connected  with  one  another 
as  in  the  ordinary  method.  If  we,  in  a  suitable  way,  take  advantage  of  regularity  in  the  obser- 
vations, and  thereby  'are  able,  to  find  a  transformation  which  sets  the  normal  equations  free, 
then  the  determination  of  the  several  elements  will  scarcely  throw  any  material  obstacles 
in  our  way.  But  in  order  to  find  out  any  special  transformations,  we  must  know  the 
general  form  of  the  changes  of  the  normal  equations  resulting  from  transformation  of  the 
original  elements  into  such  as  are  any  homogeneous  linear  funtions  of  them  whatever. 

If  the  equations   for  the   unbound  observations  in  terms  of  the  original  elements 
have  been 

o.  =  p.x  +  q.y  +  r.z, 

the  normal  equations  will  be: 

[po]  = 
[qo]  =  = 
[ro\  =  [rp]x-\-[rq]y  -\-[rr]z. 

And  if  we  wish  to  substitute  new  elements,  £,  37,  and  £»  for  the  old  ones,  we  make  use  of 
substitutions  in  which  the  original  elements  are  represented  as  functions  of  the  new  ones, 

therefore 

x  =  h^  +  ktf  +  l^      I 

y  =  M  +  M  +  ^C  (H4) 

z  ==  AsS  +  M  +  ^C-     J 

The  equations  for  the  observations  then  have  the  form 

Oi  =  (pihl  +2,-A2  +  r<&8)£  -f  (Pikl  +2.^2  +»*ffca)?  +  (Pili  +2^2  +  M3)C-    (115) 


The  new  normal  equations  may  be  formed  from  these,  but  the  form  becomes  very  cumbrous, 
the  equation  which  specially  refers  to   £  being 


o]  =  = 

^2  +  ^3)  (jPJi  +  ^g  +  rla)] 


The  computation  ought  not  to  be  performed  according  to  the  expressions  for  the  coefficients 
which  come  out  when  we  get  rid  of  the  round  brackets  under  the  signs  of  summation  [  ]. 
But  it  is  easy  to  give  the  rule  of  the  computation  with  full  clearness.  The  old  normal 
equations  are  first  treated  exactly  as  if  they  were  equations  for  unbound  observations,  for 
x,  y,  and  z,  respectively;  expressed  by  the  new  elements,  consequently  by  multiplication, 
by  columns,  by  A,,  A2,  and  h3  and  addition;  by  multiplication  by  &,,  &2,  and  ks  and 
addition;  and  by  multiplication  by  ^,  12,  and  ls  and  succeeding  addition.  Thereby,  certainly, 
we  get  the  new  normal  equations,  but  still  with  preservation  of  the  old  elements: 
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The  second  part  of  the  operation  must  therefore  consist  in  the  substitution  of  the 
new  elements  for  the  original  ones  in  the  right  hand  sides  of  these  equations.  In  order 
to  find  the  coefficients  of  £,  351,  and  £,  we  must  therefore  here  again  multiply  the  sums  of 
the  products,  now  by  rows,  by 

"  \  1    "  2  '        3 

If        If        If 
n/j,   rt/gf   Ks 

M'    *ti    ^a 

and  add  them  up. 

Example.  It  happens  pretty  often,  for  instance  in  investigations  of  scales  for 
linear  measures,  that  there  is  symmetry  between  the  elements,  two  and  two,  xr  and  xm-r, 
so  that  for  instance  the  normal  equation  which  specially  refers  to  av,  has  the  same  coeffi- 
cients, only  in  inverted  order,  as  the  normal  equation  corresponding  to  xm_r\  of  course, 
irrespective  of  the  two  observed  terms  [po]  on  the  left  hand  sides  of  the  equations. 
Already  P.  A.  Hansen  pointed  out  that  this  indicates  a  transformation  of  the  elements 
into  the  mean  values  sr  =  \(xr-\-xm-r}  and  their  half  differences  dr  =  \(xr  —  #m_r).  In 
this  case  therefore  the  equations  for  the  old  elements  by  the  new  ones  have  the  form 

xr      =  sr-\-  dr 

Xtn  —  r  ~~    $r         (*r  i 

and  the  transformation  of  the  normal  equations  is,  consequently,  performed  just  by  forming 
sums  and  differences  of  the  original  coefficients.     If  the  normal  equations  are 

[ao]  =  4x  -f  3y  -f-  2z  +  IM 

[bo]  =  3a;  +  6y  +  4s  +  2w 

[co]  ==  2x  -f  4y  +  6z  -f  3w 

[do]  =  \x  -f  2y  +  3^  -f  4w  , 
the  procedure  is  as  follows: 


[ao]-{-[do]  ==  5 

[bo]  +  [co]  =-5 

[ao]-[do]  ==  3*+    \y-     \z  -  3u  - 

6o_co  =  laj+   2—  2z-lu  - 


A 

14 
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As  in  this   example,   we  always    succeed    in    separating   the   mean  values    from   the  halt 
differences,  as  two  mutually  free  systems  of  functions  of  the  observations. 

§  63.  The  great  simplification  that  results  when  the  observations  are  mere  repe- 
titions, in  contradistinction  to  the  general  case  when  there  are  varying  circumstances  in 
the  observations,  is  owing  to  the  fact  that  the  whole  adjustment  is  then  reduced  to  the 
determination  of  the  mean  values  and  the  mean  errors  of  the  observations.  Before  an  adjust- 
ment, therefore,  we  not  only  take  the  means  of  any  observations,  which  are  strictly  speaking 
repetitions,  but  we  also  save  a  good  deal  of  work  in  the  cases  which  only  approximate  to 
repetitions,  viz.  those  where  the  variations  of  circumstances  have  been  small  enough  to  allow 
us  to  neglect  their  products  and  squares.  It  has  not  been  necessary  to  await  the  systematic 
development  of  the  theory  of  observations  to  know  how  to  act  in  such  cases. 

When  astronomers  have  observed  the  place  of  a  planet  or  a  comet  several  times 
in  the  same  night,  they  form  a  mean  time  of  observation  £,  a  mean  right  ascension  a, 
and  a  mean  declination  S,  and  consider  a  and  S  the  spherical  co-ordinates  of  the  star  at 
the  time  t. 

With  the  obvious  extensions  this  is  what  is  called  the  normal  place  method,  the 
most  important  device  in  practical  adjustment.  Such  observations  whose  essential  circum- 
stances have  "small"  variations,  are,  before  the  adjustment,  brought  into  a  normal  place,  by 
forming  mean  values  both  for  the  observed  values  themselves  and  for  each  of  their  essential 
circumstances,  and  on  the  supposition  that  the  law  which  connects  the  observations  and 
circumstances,  holds  good  also,  without  any  change,  with  respect  to  their  mean  values. 

Much  trouble  may  be  spared  by  employing  the  normal  place  method:  The  question 
is,  whether  we  lose  thereby  in  exactness,  and  then  how  much. 

We  shall  first  consider  the  case  where  the  unbound  observations  o  are  linear 
functions  of  the  varying  essential  circumstances  a?,  . . . .  z,  the  equation  for  the  observa- 
tions being: 

A ,  (o)  =  a  -f  bx  -f  •  •  •  •  +  dz. 

With  the  weights  v  we  form  the  normal  equations: 

[vo  ]  =  =  a[v]+b[vx]+  ...  -\-d\vz\  (117) 

\vxo\  —  a  \yx\  -f-  b  [vx2]  -f-  .  .  .  -(-  d  \vxz]    \ 

(118) 
[vzo]  =  a\yz\  -f-  &[#£#]  -j-  .  .  .  -\-  d[vz2].  ) 

If  the  whole,  series  of  observations  is  gathered  into  a  single  normal  place,  0, 
corresponding  to  the  circumstances  X  . .  .  Z,  and  with  the  weight  F,  we  shall  have : 
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V     _[„] 

VO   ^   \r.,\ 

VX  -  \vx] 

VZ  =  [vz\  , 
and  as 

0  = 


I  his  normal  place  will  exhaust  the  normal  equation  (117)  corresponding  to  the  constant 
l  cnii,  both  with  respect  to  mean  value  and  mean  error.  But  if  we  make  the  other  normal 
equations  free  of  (117),  we  get,  by  the  correct  method  of  least  squares: 

\v(o  —  0)(x—X)~\  =-  b[v(x—  .X)2]  +  •  •  •  +  d\v(x—  X)(z—  Z)}   I 

...................................................  (H8a) 

[v(o  —  0)(z-Z)]  =  b[v(x—  X)(z  —  Z)]  +  ...+d[v(z-Z)*\    I 

for  the  determination  of  the  elements  b  .  .  .  d,  and  these  determinations  are  lost  completely 
if  the  whole  series  is  gathered  into  a  single  normal  place.  Certainly,  the  coefficients  of  these 
equations  (118  a)  are  small  quantities  of  the  second  order,  if  the  x—  X  and  z  —  Z  are 
small  of  the  first  order. 

If,  on  the  other  hand,  we  split  up  the  series,  forming  for  each  part  a  normal 
place,  and  adjusting  these  normal  places  instead  of  the  observations  according  to  the 
method  of  the  least  squares,  then  the  normal  equation  corresponding  to  the  constant 
term  is  still  exhausted  by  the  normal  place  method;  and  besides  this  determination  of 
a-{-bX-{-  .  .  .  -\-dZ  the  normal  place  method  now  also  affords  a  determination  of  the  other 
elements  b  .  .  .  d,  in  such  a  way,  however,  that  we  suffer  a  loss  of  the  weights  for  their 
determination.  This  loss  can  become  great,  nay  total,  if  the  normal  places  are  selected  in 
a  way  that  does  not  suit  the  purpose;  but  it  can  be  made  rather  insignificant  by  a 
suitable  selection  of  normal  places  in  not  too  small  a  number. 

Let  us  suppose,  in  order  to  simplify  matters,  that  the  observations  have  only  one 
variable  essential  circumstance  x,  of  which  their  mean  values  are  linear  functions,  con- 
sequently 

^i(°)  =  a  -\-bXi 

and  that  the  xs  are  uniformly  distributed  within  the  utmost  limits,  x0  and  #,  ;  we  then  let  each 
normal  place  encompass  an  equally  large  part  of  this  interval,  and  we  shall  find  then,  this 

being  the  most  favourable  case,  with  n  normal  places,  that  the  weight  on  the  adjusted  value  of 

(l  \2 
—  1  ,  if  by  a  correct  adjustment  by  elements  the  corresponding 
ft  I 

weight  is  taken  as  unity.  The  loss  is  thus,  at  any  rate,  not  very  great.  And  it  can  be 
made  still  smaller,  if  the  distribution  of  the  essential  circumstance  of  the  observations  is 

14* 
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uneven,  and  if  we  can   get  a  normal   place    everywhere  where    the    observations    become 
particularly  frequent,  while  empty  spaces  separate  the  normal  places  from  each  other. 

The  case  is  analogous  also  when  the  observations  are  still  functions  of  a  single 
or  a  few  essential  circumstances,  but  the  function  is  of  a  higher  degree,  or  transcendental. 
For  it  is  possible  also  to  form  normal  places  in  these  cases;  and  we  can  do  so  not  only 
when  the  variations  of  the  circumstances  can  be  directly  treated  as  infinitely  small  within 
each  normal  place,  which  case  by  Taylor's  theorem  falls  within  the  given  rule.  For  if  we 
have  at  our  disposal  a  provisional  approximate  formula,  y  =  f(x),  and  have  calculated  the 
deviation  from  this,  o  — ?/,  of  every  observation  (considering  the  deviations  as  observations 
with  the  essential  circumstances  and  mean  errors  of  the  original  observations),  then  we 
can  use  mean  numbers  of  deviations  for  reciprocally  adjacent  circumstances  as  corrections 
which,  added  to  the  corresponding  values  from  the  approximate  formula,  give  the  normal 
values.  Further,  it  is  required  here  only  that  no  normal  place  is  made  so  comprehensive 
that  the  deviations  within  its  limits  do  not  remain  linear  functions  of  the  essential 
circumstances. 

Also  here  part  of  the  correctness  is  lost,  and  it  is  difficult  to  say  how  much.  The 
loss  is,  under  equal  circumstances,  smaller,  the  more  normal  places  we  form.  With  twice 
(or  three  times)  as  many  normal  places  as  the  number  of  the  unknown  elements  of  the 
problem,  it  will  rarely  become  perceptible.  With  due  regard  to  the  essential  circumstances 
and  the  distribution  of  the  weights  we  can  reduce  it,  using  empty  spaces  as  boundaries 
between  the  normal  places. 

A  suitable  distribution  of  the  normal  places  also  depends  on  what  function  the 
observations  are  of  their  essential  circumstances.  As  to  this,  however,  it  is,  as  a  rule, 
sufficient  to  know  the  behaviour  of  the  integral  algebraic  functions,  as  we  generally,  when 
we  have  to  do  with  functions  which  are  essentially  different  from  these,  will  try  through 
transformations  of  the  variables  to  get  back  to  them  and  to  certain  functions  which 
resemble  them  in  this  respect. 

We  need  only  consider  the  cases  in  which  we  have  only  one  variable  essential 
circumstance,  of  which  the  mean  value  of  the  observation  is  an  algebraic  function  of  the 
rih  degree.  We  are  able  then,  on  any  supposition  as  to  the  distribution  of  the  observations, 
o,  and  their  essential  circumstances,  x,  and  weights,  v,  to  determine  r-j-1  substitutive 
observations,  0,  together  with  the  essential  circumstances,  X,  and  weights,  F,  belonging 
to  them,  in  such  a  way  that  they  treated  according  to  the  method  of  the  least  squares 
will  give  the  same  results  as  the  larger  number  of  actual  observations.  The  conditions  are : 

[ov]      —  00F0+  ...+OrVr  \ 
J».  >«'!«» (119) 

[OX'V]    =    XIO,  Fo  +   .  .  .   -f 


and 

N 

(120) 

These  3r+2  equations  arc  not  quite  sufficient  for  the  determination  of  the  3r-4-3 
unknowns.     We  remove  the  difficulty  in  the  best  way  by  adding  the  equation: 


I  ,•<•» 

The  elimination  of  the  F's  (and  O's)  then  leads  to  an  equation  of  the  r+1  degree,  whose 
roots  X0,  .  .  .  Xr  are  all  real  quantities,  if  the  given  #'s  have  been  real  and  the  t>'s 
positive.  When  the  roots  are  found,  we  can  compute,  first  F0,  ...  Vr  and  afterwards 
00,  ...  Or,  by  means  of  two  systems  of  r  +  1  linear  equations  with  r  4-  1  unknowns. 

If,  for  instance,  the  essential  circumstances  of  the  actual  observations  are  contained 
in  the  interval  from  —  1  to  +  1,  and  if  the  observations  are  so  numerous  and  so  equally 
distributed  that  they  may  be  looked  upon  as  continuous  with  constant  mean  error  every- 
where in  this  interval;  if,  further,  the  sum  of  the  weights  =  2;  then  the  distribution  of 
the  substitutive  observations  will  be  symmetrical  around  0,  and,  for  functions  of  the  lowest 
degrees,  be 


X  = 

•000 

V  = 

2-000  ; 

X  = 

-  -577, 

+  •577 

V  = 

1-000, 

1-000  ; 

JL    = 

-  -775, 

•000, 

+  •775 

V  -- 

•556, 

•889, 

•556 

X  = 

-  -861, 

-  -340, 

+  '340, 

F  = 

•348, 

•652, 

•652, 

X  = 

-  -906, 

-  -538, 

•000, 

F  = 

•237, 

•479, 

•569, 

Y    — i 


r  = 


+  •861 

•348  ; 

...          _..,          ___,     +-538,  +-906 

•479,         -237  ' 

\X  -       -  -932,      -  -661,      -  -239,     +  -239,  +  -661,      -  -932 
{  F  =          -171,         -361,         -468,         -468,         -361,         -171  ; 

}  X  -       -  -949,      -  -742,      -  -406,         -000,  +  -406,     +  -742,     +  -9 
6  i  V  -          -129,         -280,         -382,         -418,         -382,         -280,         -129  ' 

If,  in  another  example,  the  distribution  of  the  observations  is,  likewise,  continuous, 
but  the  weights  within  the  element  dx  proportional  to  e~^x\  consequently  symmetrical  with 
maximum  by  x  =  0,  then  the  distribution  for  the  lowest  degrees,  the  only  ones  of  any 
practical  interest,  will  be 
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\X  -  -000 

0  I  F  =      2-000  ; 

I  x  =  —i-ooo,   +1-000 

1  (  F  =       1-000,        1-000  ; 

4  \X  =  —1-732,          -000,     +1-732 

2  |  F  =        -333,        1-333,          -333  ; 

fX  =     -2-334,       -  -742,     +  -742,     +2-334 

3  \  F  =        -092,          -908,          -908,          -092  ; 

\X  =     -2-857,     -1-356,          -000,     +1-356,     +2-857 
|  F  =        -023,          -444,        1-067,          -444,          -023  ; 

\X  =     —3-324,     -1-889,       -  -617,     +  -617,     +1-889,     +3-324 
5 1 F  =        -005,          -177,          -818,          -818,          -177,          -005  ; 

\X  =  —3-750,     —2-367,     -1-154,          -000,    +1-154,     +2-367,      3-750 
6  {  F  =        -001,          -062,          -480,          -914,          -480,          -062,        -001  ' 

If  we  were  able  now  to  represent  these  substitutive  observations  as  normal  places, 
then  we  should  be  able  also,  by  the  use  of  such  tables  in  analogous  cases,  to  prevent  any 
loss  of  exactness.  It  would  be  possible  entirely  to  evade  the  application  of  the  method  of 
the  least  squares;  we  had  but  to  form  such  qualified  normal  places  in  just  the  same 
number  as  the  adjustment  formula  contains  elements  that  are  to  be  determined.  This, 
however,  is  not  possible.  Certainly,  we  can  obtain  normal  places  corresponding  to  the 
required  values  of  the  essential  circumstance,  but  we  cannot  by  a  simple  formation  of 
mean  numbers  give  them  the  weight  which  each  of  them  ought  to  have,  without  employing 
some  of  the  observations  twice,  others  not  at  all.  By  taking  into  consideration  how  much 
the  extreme  normal  places  from  this  reason  must  lose  in  weight,  compared  to  the  sub- 
stitutive observations,  we  can  estimate  how  many  per  cent  the  loss,  in  the  worst  case, 
can  amount  to.  In  the  first  of  our  examples  we  find  the  loss  to  be  0,  for  r  =  0  and 
r  =  1;  but  for  r  =  2  we  lose  15,  for  r  =  3  we  lose  19,  for  r  =  4  we  lose  20,  and 
for  greater  values  of  r  21  p.  c. 

Example.  Eighteen  unbound  observations,  equally  good,  Xt(o)  =  ^,  correspond 
to  an  essential  circumstance  whose  values  are  distributed  as  the  prime  numbers  p  from 
1049  to  1171.  Taking  (p— 1105) :  100  =  x  as  the  essential  circumstance  of  the  observa- 
tion o,  we  have: 
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X                 0 

-  -51)      -  -41 
-•54    +  -50 

X                 0 

-  -14      -  -15 
-  -12      -  -32 

—  -08     -f-  -33 
-  -02      -  -21 

X                 0 

+  -18  -  -24 
+•24  +-09 

-  -44      -  -03 
-  -42      -  -15 
-•36    +-48 

+  •46  +-39 

+  •48  +-12 

+  •04    -f-21 

+  •12    +-40 

+  -58  -  -24 
+  -66  -  -39 

—  •18    +  -18 

Dividing  these  observations  into  groups  indicated  by  the  horizontal  lines,  we  get 

the  6  normal  places: 

x  o 

-  -550  +  -045 

-  -407  +  -100 

-  -108  -  -034 
+  •145  +-115 
+  -470  +  -255 


f -620 


weight 
2 
3 
5 
4 
2 
2 


-•315 

If  we  suppose  the  mean  values  of  the  observations  to  be  a  function  of  the  third, 
eventually  second,  degree  of  or,  ^(o)  =  a-\-bx-\-cxz-\-dxB,  we  have  by  ordinary  application 
of  the  adjustment  by  elements  the  normal  equations: 

6-72  =  216-00  a  -     1-20  b  +  29-98  c  +  1-94  d 
-3-07  =  -1-20  a  +  29-98  6  +    1-94  c  + 8- 
-1-08=    29-98«  +    1-946+ 
-1-44=       1-94  a  +    8-1U  + 

By  the  free  equations: 

6-72  =  216-00 a  -     1-206  +  29-98 c  +  1-94 d 
-  3-03  =  29-97  b  +  2-11  c  +  $-12d 

-1-79  =  3-80  c  +    -37  d 

-   -54  =  -305  d 

we  get: 

a  =  +  -09,     a'  =  +-10, 

b  =,  +  -40,  b'  =  —  -07, 
c  =-  --  -30,  c'  =  —-47, 
d  =  —1-77, 

where  a',  6',  c'  are  the  coefficients  in  the  functions   of  second  degree,  obtained   by   pre- 
supposing d  =  0. 
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Now,  by  application   of  the  normal  places  instead  of  the  original  observations,  we 
obtain  on  the  same  suppositions  the  normal  equations: 

6-72  =  =  216-00  a  -     1-20  b  +  29-45  c  +  1-87  d 

_2-84  =  —1.20  a  +  29-45  b  +    1-87  c  +  7-93  d 

-   -54=       29-45  a  +    1-876+    7-93  c  +  M4  d 

-1-57  =      1-87  a  +    7-936+    1-14  c  +  2-45  d. 

By  the  free  equations: 

6-72  ==  216-00  a  -     1-20  6  +  29-45  c  +  1-87  d 

-  2-80  =  29-44  6  +    2-03  c  +  7-94  d 
-1-26  =  3-77  c+    -34  d 

-  -76  =  -263d, 
we  get: 


a  =  +  -07, 

a'  =  +  -08, 

b  =  +  -69, 

6'  =  --07, 

c  =  --  -07, 

c'  =  -  -33, 

d  =  —2-88. 

A  comparison  between  these  two  calculations,  particularly  between  the  leading 
coefficients  in  the  free  equations,  shows  that  the  loss  of  weight  amounts  to  1  -  jj|-,  or 
14  per  cent.  But  it  is  only  in  the  equation  for  d  that  the  loss  is  so  great;  in  the  equa- 
tions for  b  and  c,  respectively,  it  is  only  two  and  one  per  cent. 

Our  normal  places  are  very  good  if  the  function  is  only  of  the  first  or  second 
degree;  for  the  function  of  third  degree  they  can  be  admitted  even  though  the  values  of 
the  elements  a,  b,  c,  d  have  changed  considerably.  For  functions  of  4th  or  higher  degrees 
these  normal  places  would  prove  insufficient. 

§  64.  That  graphical  adjustment  is  a  means  which  can  carry  us  through  great 
difficulties,  we  have  shown  already  in  practice  by  applying  it  to  the  drawing  of  curves  of 
errors.  The  remarkable  powers  of  the  eye  and  the  hand  must,  like  a  deus  ex  machina, 
help  us  where  all  other  means  fail. 

Adjustment  by  drawing  is  restricted  only  by  one  single  condition:  if  we  are  to 
represent  a  relation  between  quantities  by  a  plane  curve,  there  must  be  only  two  quantities; 
one  of  these,  represented  by  the  ordinate,  is,  or  is  considered  to  be,  the  observed  value; 
and  the  other,  represented  by  the  abscissa,  is  considered  the  only  essential  circumstance 
on  which  the  observed  value  depends. 

Examples  of  graphical  adjustment  with  two  essential  circumstances  do  occur, 
however,  for  instance  in  weather-charts.  In  periodic  phenomena  polar  co-ordinates  are 
preferred.  But  otherwise  each  observation  is  represented  by  a  point  whose  ordinate  and 


n:; 

abscissa  are,  respectively,  the  observed  value  and  its  essential  circumstance;  and  the  adjust- 
ment is  performed  by  free-hand  drawing  of  a  curve  which  satisfies  the  two  conditions 
of  being  free  from  irregularities  and  going  as  near  as  possible  to  the  several  points  of 
observation.  The  smoothness  of  the  curve  in  this  process  plays  the  part  of  the  theory, 
and  it  is  a  matter  of  course  that  we  succeed  relatively  best  when  the  theory  is  unknown 
or  extremely  intricate ;  when,  for  instance,  we  must  confine  ourselves  to  requiring  that  the 
phenomenon  must  be  continuous  within  the  observed  region,  or  be  a  single  valued  function. 
But  also  such  a  theoretical  condition  as,  for  instance,  the  one  that  the  law  of  dependence 
must  be  of  an  integral,  rational  form,  may  be  successfully  represented  by  graphical  adjust- 
ment, if  the  operator  has  had  practice  in  the  drawing  of  parabolas  of  higher  degrees.  And 
we  have  seen  that  also  such  functional  forms  as  have  the  rapid  approximation  to  an  asymptote 
which  the  curves  of  error  demand,  lie  within  the  province  of  the  graphical  adjustment. 

As  for  the  approximation  to  the  several  observed  points,  the  idea  of  the  adjust- 
ment implies  that  a  perfect  identity  is  not  necessary;  only,  the  curve  must  intersect  the 
ordinates  so  near  the  points  as  is  required  by  the  several  mean  errors  or  laws  of  errors. 
If,  after  all,  we  know  anything  as  to  the  exactness  of  the  several  observations  before  we 
make  the  adjustment,  this  ought  to  be  indicated  visibly  on  the  drawing-paper  and  used 
in  the  graphical  adjustment.  We  cannot  pay  much  regard,  of  course,  to  the  presupposed 
typical  form  and  other  properties  of  the  law  of  errors,  but  something  may  be  attained, 
particularly  with  regard  to  the  number  of  similar  deviations. 

If  we  know  nothing  whatever  as  to  the  exactness  of  the  several  observations,  or 
only  that  they  are  all  to  be  considered  equally  good,  there  can  be  only  a  single  point  in 
our  figure  for  each  observation.  In  a  graphical  adjustment,  however,  we  can  and  ought 
to  take  care  that  the  curve  we  draw  has  the  same  number  of  observed  points  on  each 
side  of  it,  not  only  in  its  whole  extent,  but  also  as  far  as  possible  for  arbitrary  divisions. 
If  we  know  the  weights  of  the  observations,  they  may  be  indicated  on  the  drawing,  and 
observations  with  the  weight  n  count  w-fold. 

In  contradistinction  to  this  it  is  worth  while  to  remark  that,  with  the  exception 
only  of  .bonds  between  observations,  represented  by  different  points,  it  is  possible  to  lay 
down  on  the  paper  of  adjustment  almost  all  desirable  information  about  the  several  laws  of 
errors.  Around  each  point  whose  co-ordinates  represent  the  mean  values  of  an  observation 
and  of  its  essential  circumstance,  a  curve,  the  curve  of  mean  errors,  may  be  drawn  in 
such  a  way  that  a  real  intersection  of  it  with  any  curve  of  adjustment  indicates  a  devia- 
tion less  than  the  mean  error  resulting  from  the  combination  of  the  mean  errors  of  the 
observed  value  and  that  of  its  essential  circumstance,  if  this  is  also  found  by  observation, 
while  a  passing  over  or  under  indicates  a  deviation  exceeding  the  mean  error.  Evidently, 
drawings  furnished  with  such  indications  enable  us  to  make  very  good  adjustments. 

15 
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If  the  laws  of  errors  both  for  the  observation  and  for  its  circumstance  are  typical, 
then  the  curve  of  mean  errors  is  an  ellipse  with  the  observed  points  in  its  centre. 

If,  further,  there  are  no  bonds  between  the  observation  and  its  circumstance,  then 
the  ellipse  of  mean  errors  has  its  axes  parallel  to  the  ordinate  and  the  abscissa,  and  their 
lengths  are  double  the  respective  mean  errors. 

If  the  essential  circumstance  of  the  observation,  the  abscissa,  is  known  to  be  free 
of  errors,  the  ellipse  of  the  mean  errors  is  reduced  to  the  two  points  on  the  ordinate, 
distant  by  the  mean  error  of  the  observation  from  the  central  point  of  observation.  In 
special  cases  other  means  of  illustrating  the  laws  of  errors  may  be  used.  If,  for  instance, 
the  mean  errors  as  well  as  the  mean  values  .are  continuous  functions  of  the  essential 
circumstance  of  the  observation,  continuous  curves  for  the  mean  errors  may  be  drawn  on 
the  adjustment  paper. 

The  principal  advantages  of  the  graphical  adjustment  are  its  indication  of  gross 
errors  and  its  independence  of  a  definitely  formulated  theory.  By  measuring  the  ordinates 
of  the  adjusted  curve  we  can  get  improved  observations  corresponding  to  as  many  values 
of  the  circumstance  or  abscissa  as  we  wish,  and  we  can  select  them  as  we  please  within 
the  limits  of  the  drawing.  But  these  adjusted  observations  are  strongly  bound  together, 
and  we  have  no  indication  whatever  of  their  mean  errors.  Consequently,  no  other  adjust- 
ment can  be  based  immediately  upon  the  results  of  a  graphical  adjustment. 

On  the  other  hand,  graphical  adjustment  can  be  very  advantageously  combined 
with  interpolations,  both  preceding  and  following,  and  we  shall  see  later  on  that  by  this 
means  we  can  remedy  its  defects,  particularly  its  limited  accuracy  and  its  tendency  to 
place  too  much  confidence  in  the  observations,  and  too  little  in  the  theory,  i.  e.  to  give 
an  under-adjustment. 

By  drawing  we  attain  an  exactness  of  only  3  or  4  significant  figures,  and  that  is 
frequently  insufficient.  The  scale  of  the  drawing  must  be  chosen  in  such  a  way  that  the 
errors  of  observations  are  visible;  but  then  the  dimensions  may  easily  become  so  large  that 
no  paper  can  contain  the  drawing.  In  order  to  give  the  eye  a  full  grasp  of  the  figure, 
the  latter  must  in  its  whole  course  show  only  small  deviations  from  the  straight  line,  which 
is  taken  as  the  axis  of  abscissae.  This  is  a  practical  hint,  founded  upon  experience.  The 
eye  can  judge  of  the  smoothness  of  other  curves  also,  but  not  by  far  so  well  as  of  that 
of  a  straight  line.  And  if  the  line  forms  a  large  angle  with  the  axis  of  the  abscissae, 
then  the  exactness  is  lost  by  the  flat  intersections  with  the  ordinates.  Therefore,  as  a  rule, 
it  is  not  the  original  observations  that  are  marked  on  the  paper  when  we  make  a  graphical 
adjustment,  but  only  their  differences  from  values  found  by  a  preceding  interpolation. 

In  order  to  avoid  an  under-adjustment,  we  must  allow  |  of  the  deviations  of  the 
curve  from  the  observation-points  to  surpass  the  mean  errors.  It  is  further  essential  that 
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the  said  interpolation  is  based  on  a  minimum  number  of  observed  data;  and  after  the 
graphical  adjustment  has  been  made,  it  is  safe  to  try  another  interpolation  using  a  smaller 
number  of  the  adjusted  values  as  the  base  of  a  new  interpolation  and  a  repeated  graphical 
adjustment. 

If  the  results  of  a  graphical  adjustment  are  required  only  in  the  form  of  a  table 
representing  the  adjusted  observations  as  a  function  of  the  circumstance  as  argument,  this 
table  also  ought  to  be  based  on  an  interpolation  between  relatively  few  measured  values, 
the  interpolated  values  being  checked  by  comparison  with  the  corresponding  measured 
values.  A  table  of  exclusively  measured  values  will  show  too  irregular  differences. 

When  we  have  corrected  these  values  by  measuring  the  ordinates  in  a  curve  of 
graphical  adjustment,  they  may  be  employed  instead  of  the  observations  as  a  sort  of  normal 
places.  It  has  been  said,  however,  and  it  deserves  to  be  repeated,  that  they  must  not  be 
adjusted  by  means  of  the  method  of  the  least  squares,  like  the  normal  places  properly  so 
called.  But  we  can  very  well  use  both  sorts  of  normal  places,  in  a  just  sufficient  number, 
for  the  computation  of  the  unknown  elements  of  the  problem,  according  to  the  rules  of 
exact  mathematics. 

That  we  do  not  know  their  weights,  and  that  there  are  bonds  between  them,  will  not 
here  injure  the  graphically  determined  normal  places.  The  very  circumstance  that  even  distant 
observations  by  the  construction  of  the  curve  are  made  to  influence  each  normal  place,  is  an 
advantage.  It  is  not  necessary  here  to  suffer  any  loss  of  exactness,  as  by  the  other  normal 
places,  which,  as  they  are  to  be  represented  as  mean  numbers,  cannot  at  the  same  time  be 
put  in  the  most  advantageous  places  and  obtain  the  due  weight.  As  to  the  rest,  however,  what 
has  been  said  p.  108—  110  about  the  necessity  of  putting  the  substitutive  observations  in 
the  right  place,  holds  good  also,  without  any  alteration,  of  the  graphical  normal  places. 

The  method  of  the  graphical  adjustment  enables  us  to  execute  the  drawing  with 
absolute  correctness,  and  it  leaves  us  full  liberty  'to  put  the  normal  places  where  we  like, 
consequently  also  in  the  places  required  for  absolute  correctness  ;  but  in  both  these  respects 
it  leaves  everything  to  our  tact  and  practice,  and  gives  no  formal  help  to  it. 

As  to  the  criticism,  the  graphical  adjustment  gives  no  information  about  the  mean 
errors  of  its  results.  But,  if  we  can  state  the  mean  error  of  each  observation,  we  are  able, 
nevertheless,  to  subject  the  graphical  adjustments  to  a  summary  criticism,  according  to 
the  rule 


=  n  —  m. 


And  with  respect  to  the  more  special  criticism  on  systematical  deviations,  the  graphical 
method  even  takes  a  very  high  rank.  Through  graphical  representations  of  the  finally 
remaining  deviations,  o  —  w,  particularly  if  we  can  also  lay  down  the  mean  errors  on  the 
same  drawing,  we  get  the  sharpest  check  on  the  objective  correctness  of  any  adjustment. 

15* 
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From  this  reason,  and  owing  to  the  proportionally  slight  difficulties  attached  to  it, 
the  graphical  adjustment  becomes  particularly  suitable  where  we  are  to  lay  down  new 
empirical  laws.  In  such  cases  we  have  to  work  through,  to  check,  and  to  reject  series 
of  hypotheses  as  to  the  functional  interdependency  of  observations  and  their  essential 
circumstances.  We  save  much  labour,  and'  illustrate  our  results,  if  we  work  by  graphical 
adjustment. 

Of  course,  we  are  not  obliged  to  subject  observations  to  adjustment.  In  the  pre- 
liminary stages,  or  as  long  as  it  is  doubtful  whether  a  greater  number  of  essential  circum- 
stances ought  not  to  be  taken  into  consideration,  it  may  even  be  the  best  thing  to  give 
the  observations  just  as  they  are. 

But  if  we  use  the  graphical  form  in  order  to  illustrate  such  statements  by  the 
drawing  of  a  line  which  connects  the  several  observed  points,  then  we  ought  to  give  this 
line  the  form  of  a  continuous  curve  and  not,  according  to  a  fashion  which  unfortunately 
is  widely  spread,  the  form  of  a  rectilinear  polygon  which  is  broken  in  every  observed 
point.  Discontinuity  in  the  curve  is  such  a  marked  geometrical  peculiarity  that  it  ought, 
even  more  than  cusps,  double-points,  and  asymptotes,  to  be  reserved  for  those  cases  in 
which  the  author  expressly  wants  to  give  his  opinion  on  its  occurrence  in  reality. 


XIV.     THE  THEORY  OF  PROBABILITY. 

§  65.  We  have  already,  in  §  9,  defined  "probability"  as  the  limit  to  which  —  the 
law  of  the  large  numbers  taken  for  granted  —  the  relative  frequency  of  an  event  approaches, 
when  the  number  of  repetitions  is  increasing  indefinitely;  or  in  other  words,  as  the  limit 
of  the  ratio  of  the  number  of  favourable  events  to  the  total  number  of  trials. 

The  theory  of  probabilities  treats  especially  of  such  observations  whose  events 
cannot  be  naturally  or  immediately  expressed  in  numbers.  But  there  is  no  compulsion  in 
this  limitation.  When  an  observation  can  result  in  different  numerical  values,  then  for 
each  of  these  events  we  may  very  well  speak  of  its  probability,  imagining  as  the  opposite 
event  all  the  other  possible  ones.  In  this  way  the  theory  of  probabilities  has  served  as 
the  constant  foundation  of  the  theory  of  observation  as  a  whole. 

But,  on  the  other  hand,  it  is  important  to  notice  that  the  determination  of  the 
law  of  errors  by  symmetrical  functions  may  also  be  employed  in  the  non-numerical  cases 
without  the  intervention  of  the  notion  of  probability.  For  as  we  can  always  indicate  the 
mutually  complementary  opposite  events  as  the  "fortunate"  or  "unfortunate"  one,  or  as 
"Yes"  and  "No",  we  may  also  use  the  numbers  0  and  1  as  such  a  formal  indication.  If 
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then  we  identify  1  with  the  favourable  "Yes"-event,  0  with  the  unfavourable  "No",  the 
sums  of  the  numbers  got  in  a  series  of  repetitions  will  give  the  frequency  of  affirmative 
events.  This  relation,  which  has  been  used  already  in  some  of  the  foregoing  examples,  we 
must  here  consider  more  explicitly. 

If  repetitions  of  the  same  observation,  which  admits  of  only  two  alternatives,  give 
the  result  "Yes"  =  1  m  times,    against  n  times  "No11  ==»  0,   then  the  relative  frequency 


for  the  favourable  event  is 


m-f-n 


But  if  we  employ  the  form  of  the  symmetrical  functions 


for  the  same  law  of  actual  errors,  then  the  sums  of  the  powers  are 

30  =  m-\-n  ,     sl  =  sz    ....   =  sr  =  m.  (121) 

In  order  to  determine  the  half-invariants  by  means  of  this,  we  solve  the  equations 


m  — 

m  = 

m  = 

m  = 


3m  . 


-f 
-f 


3m  •  ^8  -f  (m  -f  n)/jtt 


and  find  then 


m-\-n 
mn 

(w-f  w)2 
mn  (n  —  m) 
(m  -f-  n)3 
mn  (w2  —  4mn  -f-  m2) 


(122) 


(ro-f  n)* 
Compare  §  23,  example  2,  and  §  24,  example  3. 

All  the  half-invariants   are  integral  functions   of  the  relative  frequency,  which  is 


itself  equal  to 


The  relative  frequency  of  the  opposite  result  is 


w-j-w 


1— 


interchanging  m  and  w,  none  of  the  half-invariants  of  even  degree  are  changed,  and  those 
of  odd  degree  (from  p3  upwards)  only  change  their  signs. 

In    order   to   represent  the  connection  between  the  laws  of  presumptive  errors,  we 
need  only  assume,  in  (122),  that  m  and  n  increase  indefinitely,  while  the  probability  of  the 


event  becomes  p  = 


and  the  probability  of  the  opposite  event  is  represented  by 


m-\-n 
q.    The  half  invariants  are  then: 

*i  =  P 
**  =  P<1 


(123) 


Our  mean  values  are  therefore,  respectively,  the  relative  frequency  and  the  probability  itself. 
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We  must  now  first  notice  here  that  every  half-invariant  is  its  own  fixed  and 
simple  function  of  the  probability  (the  frequency).  When  a  result  of  observation  can  be 
stated  in  the  form  of  one  single  probability,  properly  so  called,  we  have  thereby  given  as 
complete  a  determination  of  the  law  of  errors  as  by  the  whole  series  of  half-invariants. 
In  such  cases  it  is  simpler  to  employ  the  theory  of  probability  instead  of  the  symmetrical 
functions  and  the  method  of  the  least  squares. 

The  theory  of  probability  thereby  gets  its  province  determined  in  a  much  more 
natural  and  suitable  way  than  that  employed  in  the  beginning  of  this  paragraph. 

But  at  the  same  time  we  see  that  the  form  of  the  half-invariants  is  not  only  the 
general  means  which  must  be  employed  where  the  conditions  for  the  use  of  the  probability 
are  not  fulfilled,  but  also  that,  within  the  theory  of  probability  itself,  we  shall  require, 
particularly,  the  notion  of  the  mean  error. 

Even  where  the  probability  can  replace  all  the  half-invariants,  we  shall  require  all 
the  various  sides  of  the  notions  which  are  distinctly  expressed  in  the  half-invariants.  Now 
we  have  particularly  to  consider  the  probability  as  the  definite  mean  value,  now  the  point 
is  to  elicit  the  definite  degree  of  uncertainty  which  is  implied  in  the  probability,  and 
which  is  particularly  emphasised  in  the  mean  error.  Otherwise,  we  should  constantly  be 
tempted  to  rely  on  the  predictions  of  the  theory  of  probability  to  an  extent  far  beyond 
what  is  justly  due  to  them.  Finally,  we  shall  see  immediately  that  the  laws  of  error  of 
the  probabilities  are  far  from  typical,  but  that  they  have  rather  a  type  of  their  own,  which 
must  sometimes  be  especially  emphasised. 

All  this  we  shall  be  able  to  do  here,  where  we  have  the  half-invariants  in  reserve 
as  a  means  of  representing  the  theory  of  probability. 

§  66.  In  particular,  we  can  now,  though  only  in  the  form  of  the  half-invariants, 
solve  one  of  the  principal  problems  of  the  theory  of  probability,  and  determine  the  law  of 
presumptive  errors  for  the  frequency  m  of  one  of  the  events  of  a  trial,  which  can  have 
only  two  events  and  which  is  repeated  .ZV  times,  upon  the  supposition  that  the  trial  follows 
the  law  of  the  large  numbers,  and  that  the  probability  p  for  a  single  trial  is  known. 

The  equations  (123)  give  us  already  the  corresponding  law  of  error  for  each  trial, 
and  as  the  total  absolute  frequency  is  the  sum  of  the  partial  ones,  we  need  only  use  the 
equations  (35)  to  find: 


Npq  =•  Np(l—p) 

Npq(q-p)  =  Np(l-p)(l-2p) 


(124) 


11!) 

The  ratio  of  the  mean  frequency  to  the  number  of  trials  is  therefore  the  probability  itself. 
When  p  is  small  the  mean  error  differs  little  from  the  square  root  VNp  of  the  mean 
frequency;  and  if  p  is  nearly  =  1,  the  mean  error  of  the  opposite  event  is  nearly  equal  to 
VN<j.  When  the  probability,  p,  is  nearly  equal  to  |,  the  mean  error  will  be  about  \  \/N. 

The  law  of  error  is  not  strictly  typical,  although  the  rational  function  of  the  rth 
degree  in  kr(m)  vanishes  for  r  different  values  of  p  between  0  and  1,  the  limits  included, 
so  that  the  deviation  from  the  typical  form  must,  on  the  whole,  be  small.  If,  however,  we 
consider  the  relative  magnitude  of  the  higher  half-invariants  as  compared  with  the  powers 
of  the  mean  error 

I-  _      1—P 


and 


VNpq 


(125) 


Npq 

the  occurence  of  Npq  in  the  denominators  of  the  abridged  fractions  shows,  not  only  that 
great  numbers  of  repetitions,  here  as  always,  cause  an  approximation  to  the  typical  form, 
but  also  that,  in  contrast  to  this,  the  law  of  error  in  the  cases  of  certainty  and  impossi- 
bility, when  2  =  0  and  p  =  0,  becomes  skew  and  deviates  from  the  typical  in  an  infinitely 
high  degree,  while  at  the  same  time  the  square  of  the  mean  errors  becomes  =  0.  This 
remarkable  property  is  still  traceable  in  the  cases  in  which  the  probability  is  either  very 
small  or  very  nearly  equal  to  1.  In  a  hundred  trials  with  the  probability  =  99£  per  ct. 
the  mean  error  will  be  about  =  ]/±.  Errors  beyond  the  mean  frequency  99i  cannot 
exceed  ^,  and  are  therefore  less  than  the  mean  error.  The  great  diminishing  errors  must 
therefore  be  more  frequent  than  in  typical  cases,  and  frequencies  of  97  or  96  will  not  be 
rare  in  the  case  under  consideration,  though  hey  must  be  fully  counter-balanced  by 
numerous  cases  of  100  per  ct.  The  law  of  error  is  consequently  skew  in  a  perceptible 
degree.  In  applications  of  adjustment  to  problems  of  probability,  it  is,  from  this  reason, 
frequently  necessary  to  reject  extreme  probabilities. 


XV.  THE  FORMAL  THEORY  OF  PROBABILITY. 

§  67.  The  formal  theory  of  probability  teaches  us  how  to  determine  probabilities 
that  depend  upon  other  probabilities,  which  are  supposed  to  be  given.  Of  course,  there 
are  no  mathematical  rules  specially  applicable  to  computations  that  deal  with  probabilities, 
and  there  are  many  computations  with  probabilities  which  do  not  fall  under  the  theory  of 
probability,  for  instance,  adjustments  of  probabilities.  But  in  view  of  the  direct  application 
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of  probabilities,  not  only  to  games,  insurances,  and  statistics,  but  to  all  conditions  of 
life,  it  will  be  understood  that  special  importance  attaches  to  the  marks  which  show 
that  a  computation  will  lead  us  to  a  probability  as  its  result,  as  this  implies  in  part  or 
in  the  whole  a  determination  of  a  law  of  errors.  The  formal  theory  of  probabilities  rests 
on  two  theorems,  one  concerning  the  addition  of  probabilities,  the  other  concerning  their 
multiplication. 

I.  The  theorem  concerning  the  addition  of  probabilities  can,    as  all  probabilities 
are  positive  numbers,  be  deduced  from  the  usual  definition  of  addition  as  a  putting  together: 
if  a  sum   of  probabilities  is  to   be  a  probability  itself,   we  must  be  allowed  to  look  upon 
each   of  the  probabilities  that  we  are   to   add   together  as  corresponding  to  its  particular 
events.    These  events  must  mutually  exclude  one  another,  but  must  at  the  same  time  have 
a  quality  in  common,  to  which,  after  the  addition,  our  whole  attention  must  be  given.    If 
the  sum  is  to  be  the  correct  probability  of  events  with  this  quality,  the  same  quality  must 
be  found  in  no  other  event  of  the  trial.   An  "either — or"  is,  therefore,  the  simple  gramma- 
tical mark  of  the  addition  of  probabilities.    The  event  Es ,  whose  probability  is  p1+^>2' 
must  occur,  if  either  the  result  £"x,  whose  probability  is  p^  or  the  quite  different  event  Ez, 
whose  probability  is  ^?2,  occurs,  and  not  in  any  other  case.    If  we  require  no  other  resem- 
blance  between  the  events   whose  probabilities  are  added  together,   than  that  they  belong 
to  the  same  trial,   their  sum  must  be  the  probability  1,  certainty,  because  then  all  events 
of  the  trial  are  favourable.     If  p  be  the  probability  for  a  certain  event,  q  the  probability 
against  the  same,  then  we  have  p-\-q  =  1,  g  =  1 — p>    If  n  events  of  the  same  trial  be 
equally  probable,  the  probability  of  each   being  =  p,   then   the  aggregate  probability  of 
these  events  is  =  np. 

II.  The  theorem  concerning  the  multiplication  of  probabilities  can,   as  all  proba- 
bilities are  proper  fractions,  be  deduced  from  the  definition  of  the  multiplication  of  frac- 
tions, according  to  which  the  product  is  the  same  proportional  of  the  multiplicand  as  the 
multiplier  is  of  unity.    Only  as  probabilities  presuppose  infinite  numbers  of  trials^  we  shall 
commence  by  proving  the  corresponding  proposition  for  relative  frequencies. 

If,  in  p  =  Pipz,  pv  is  a  relative  frequency,  it  must  relate  to  a  trial  Tl  which, 
repeated  N  times,  has  given  favourable  events  in  Npt  cases;  and  if  p^,  being  also  a  relative 
frequency,  takes  the  place  of  multiplier,  then  the  corresponding  trial  T2,  if  repeated  Npv 
times,  must  have  given  (Np^p^  favourable  events.  Now  in  the  multiplication  p  =  pip.z, 
p  must  be  the  relative  frequency  of  the  compound  trials  which  out  of  the  total  number  of 
N  repetitions  have  given  Np^p^  favourable  events.  The  trials  Tt  and  T2  must  both  have 
succeeded  as  conditional  for  the  final  event.  As  the  number  N  can  be  taken  as  large  as 
we  please,  the  same  proposition  must  hold  good  for  probabilities. 
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The  probability  p  =/>,/>«,  as  the  product  of  the  probabilities  pv  and  pt,  relates  to 
the  event  of  a  compound  trial,  which  is  favourable  only  if  both  conditional  trials,  7',  and 
7',,  have  given  favourable  events:  first  the  trial  2\  must  have  had  the  event  whose  pro- 
bability is  />,,  and  then  the  other  trial  T2  must  have  succeeded  in  the  event,  whose 
probability,  on  condition  of  success  in  7",,  is  p%.  However  indifferent  the  order  of 
the  factors  may  be  in  the  numerical  computation  it  is  nevertheless,  if  a  probability  is 
correctly  to  be  found  as  the  product  of  the  probabilities  of  conditional  events,  necessary 
to  imagine  the  conditional  trials  arranged  in  a  definite  order.  To  prove  this  very  important 
proposition  we  shall  suppose  that  both  conditional  trials  are  carried  out  in  every  case  of 
the  compound  trial.  Let  both  Tl  and  Tz  have  succeeded  in  a  cases,  while  only  T,  has 
succeeded  in  b  cases,  only  T2  in  c  cases,  and  neither  in  d  cases.  Considering  each  of  the 

two    trials    without   any   regard    to    the   other,    we  therefore   get   -  —  j  =  P.  and 

-- 


a  . 

.  7_       ,  =  Py  as  the  frequencies  or  probabilities   of  their  favourable  events.    But  in 

the  multiplication  for  computation  of  the  compound  probability,  P1  and  P2  are  applicable 

only  as    multiplicands;  the  correct  result  p  =  —  ^-.-^  —  r—j  is  found   by    »  =  P.  —  ~. 

a  -\-b-\-c-\-d  a  +  6 

or  by  p  =  P2  •  —   —  ,  according  to   the  order  in  which  the  trials  are  executed,   but  not 

C*  ~~j     C 

as  p  =  P1PZ,  unless  a:b  =  c:d.  But  this  proportion  expresses  that  the  frequency  or 
probability  of  the  trial  Tz  is  not  affected  by  the  event  of  the  trial  7\.  This  proportionality 
is  the  mark  of  freedom,  if  we  consider  the  multiplication  of  probabilities  as  the  determination 
of  the  law  of  errors  for  a  function  of  two  observed  values  whose  laws  of  errors  are  given. 

Since  impossibility  is  indicated  by  probability  =  0,  we  see  that  the  compound 
trial  is  impossible,  if  there  is  any  of  the  conditional  trials  that  cannot  possibly  succeed, 
i.  e.  if  pv  =  0  or  p2  =  0  in  p  =  Pipz.  The  condition  of  certainty  (probability  =  1)  in 
a  compound  trial  is  certainty  for  the  favourable  events  of  all  conditional  trials;  for  as  pl 
and  pz  as  probabilities  must  be  proper  fractions,  pvpz  =  p  =  l  will  be  possible  only  when 
both  pl  =  1  and  p2  =  1. 

Example  1.  When  the  favourable  events  of  all  the  conditional  trials,  n  in 
number,  have  the  same  probability  p,  the  compound  event,  which  depends  on  the  success 
of  all  these,  has  the  probability  pn.  If  by  every  single  drawing  there  is  the  probability  of 
£  for  "red"  and  $  for  "black",  the  probability  of  10  drawings  all  giving  red  will  be 


Example  2.  Suppose  a  pack  of  52  cards  to  be  so  well  shuffled  that  the  probabi- 
lities of  red  and  black  may  constantly  be  proportional  to  the  remainder  in  the  stock,  then 
the  probability  of  the  10  uppermost  cards  being  red  will  be 

26252423222120191817         Igg  1  42  [10         &6(10)  19  1 

=  52'5r50'49*48'47'46*45'44*43  =  =   [52  |16  |10   "=  /?62(10)  ==  56588   =  =  2978' 
the  /9«(#)  being  binomial  functions. 

16 
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Example  3.  Compute  the  probability  that  a  man  whose  age  is  a  will  be  still  alive 
after  n  years,  and  that  he  will  die  in  one  of  the  succeeding  m  years. 

If  we  suppose  that  q.  is  the  probability  that  a  man  whose  age  is  i  will  die  before 
his  next  birthday,  the  probability  that  the  man  whose  age  is  a  will  be  alive  at  the  end 
of  n  years  will  be 

r>  t-\  \  / 1  \  /  -t  \ 

Pn  =  (1—  qa)  (1 —  qa+i) ....  (1—  qa+n-i)- 

The  probability  Qm  of  his  then  dying  in  either  one  or  the  other  of  the  succeeding 
m  years  will  be 

Qm    =    qa+n  +  (1—  <Ia+n)  \<}a+n+i  +  (1—  ?a+n+l)  [%a+n+2  +   •  •  •    +  (1~  <?a-|-n+m-2)  qa+n+m-i]}  ', 

or 

1 Q     —  n Q      \  n Q          \         n Q  \ 

The  required  probability  of  death  after  n  years,  but  before  the  elapse  of  n-\-m  years,  is 
consequently  PnQm  =  Pn  —  Pn+m. 

The  most  convenient  form  for  statements  of  mortality  is  not,  as  we  here  supposed, 
a  table  of  the  probabilities  q.  for  all  integral  ages  «,  but  of  the  absolute  frequencies  Z,  of 
the  men  from  a  large  (properly  infinitely  large)  population  who  will  reach  the  age  of  i. 
After  this  q.  =  -  — ~^ ,  ( 1  —  <?,  =  -^ )  will  only  be  a  special  case  of  the  general 

H  \  l*i    I 

answer : 

la 

Example  4.  We  imagine  a  game  of  cards  arranged  in  such  a  way  that  each 
player,  in  a  certain  order,  gets  two  cards  of  the  well-shuffled  pack,  and  wins  or  loses 
according  as  the  sum  of  the  points  on  his  two  cards  is  eleven  or  not.  For  5  players  we 
use,  for  instance,  only  the  cards '1,  2,  3,  4,  5,  6,  7,  8,  9,  and  10  of  the  same  colour. 

What  then  is  the  probability  of  h  players  (named  beforehand)  getting  11  and  not 
ariy  of  the  5  —  h  others? 

Secondly,  what  probability,  rt,  is  there  that  the  fcth  player  in  succession  will  be 
the  first  who  gets  11? 

Lastly,  what  is  the  probability,  <?,  that  none  of  the  players  will  get  11? 

It  will  be  found  perhaps  that  it  is  not  quite  easy  to  compute  these  probabilities 
directly.  In  such  cases  it  is  a  good  plan  to  reconnoitre  the  problem  by  first  bringing  out 
such  results  as  present  themselves  quite  easily  and  simply,  without  considering  whether 
they  are  just  those  we  require.  In  this  case,  for  instance,  we  take  the  probabilities,  £>.,  that 
each  of  the  first  i  players  will  get  11. 

We  then  attack  the  problem  more  seriously,  and  examine  if  there  are  not  any 
simple  functions  of  the  probabilities  we  have  found,  p.,  which  may  be  interpreted  as  pro- 
babilities of  the  same  or  similar  sort  as  those  inquired  after. 


t;  <>8.  Kepetitions  of  the  same  trial  occur  very  frequently  in  problems  solvable  by 
UK-  theory  of  probabilities,  and  should  always  by  treated  by  means  of  a  very  simple  and 
important  law,  the  polynomial  formula. 

Let  us  suppose  that  the  various  events  of  the  single  trial  may  be  indicated  by  colours, 
and  that,  in  the  single  trial,  the  probability  of  white  is  w,  of  black  6,  and  of  red  r. 

The  probability  that  we  shall  get  in  x -\-y-\-z  trials  x  white,  y  black,  and  z  red 
results,  in  a  given  order,  is  then 

wx  •  by  •  r*. 

The  number  of  the  events  of  this  kind  that  differ  only  in  order,  is  the  trinomial 
coefficient 


1-2  ...x.l<2  ...yl-2  ...z  ' 

which  is  the  coefficient  of  the  term  w*  •&••?*  in  the  development  of  (w  -\-  b  -f-r)  (T+|r+*>. 
And  this  same  term 

r  (r    ti    ?\  wx  .  hv  .  r*  C\  9fi\ 

T  \'bi    y,    <&)  tV         ™       '  \ji£\>j 

is  the  required  probability  of  getting  white  x  times,  black  y  times,  and  red  z  times  by 
{x -\-y-\-z}  repetitions. 

When  the  probabilities  of  all  possible  single  results  are  known  and  employed,  so 
that  w-\-b-\-r-\-  . . .  =1,  and  when  the  number  of  repetitions  is  w,  we  must  consequently 
imagine  (w-\-b-\-r-\-  ...)"  developed  by  the  polynomial  theorem,  and  the  single  terms  of 
the  development  will  then  give  us  the  probabilities  of  the  different  possible  events  of  the 
repetitions  without  regard  to  the  order  of  succession. 

Example  1.  If  the  question  is  of  the  probability  of  getting,  in  10  trials  in  which 
there  are  the  three  possible  events  of  white,  black,  and  red,  even  numbers  x,  y,  and  z  of 
each  colour,  and  if  the  probabilities  of  the  single  events  are  w,  b,  and  r,  respectively,  then 
we  must  retain  the  terms  of  (w -\- b -\- r)10  which  have  even  indices,  and  we  thus  find: 

2)  -f  210w6  (64-f  66V2+r4)  -f  210ttf4  (&6-f-15&4r24-1562r4-f  r6)  -f 

>4r4_|_286V6-{-r8)  -f  610-f  45&8r2-|-21066r4-f  21064r6-|-4562r8-|-  r10  = 

[_26)lo+(l— 2r)10}. 

The  probability,  consequently,  is  always  greater  than  {,  but  only  a  little  greater,  unless 
the  probability  of  getting  some  of  the  events  in  a  single  trial,  is  very  small. 

Example  2.  Peter  and  Paul  play  at  heads-or-tails  (i.  e.  probability  =  A  for  and 
against).  But  Peter  throws  with  3  coins,  Paul  only  with  2,  and  the  one  wins  who  gets 

16* 
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the  greatest  number  of  "heads".  If  both  get  the  same  number  of  heads  they  throw  again, 
as  often  as  may  be  necessary.  What  is  the  probability  that  Peter  will  win? 

If  we  write  for  Peter's  probability  for  and  against  throwing  heads  pl  —  \  and 
?i  ^  ib  f°r  Paul's  Pz  ==f  and  q2  =  \ ,  then  we  should  develop  "(Pi  +  SiF'CPa  +  fo)2'  and 
the  terms  in  which  the  index  of  pl  is  greater  than  that  of  pz,  are  in  favour  of  Peter; 
those  in  which  the  indices  are  equal,  give  a  drawn  game;  and  those  in  which  the  index 
of  pz  is  greater  than  that  of  p^  are  in  favour  of  Paul.  For  the  single  game  there  is  the 
probability 

for  Peter  of  A> 

1  6 

for  a  drawn  game  of  ^, 
for  Paul  of  ^. 

1  D 

As  the  probabilities  are  distributed  in  the  same  way,  when  they  play  the  games  over 
again,  we  need  not  consider  the  possibilities  of  drawn  games  at  all,  and  we  find  ^  as 
Peter's  final  probability. 

Example  3.  A  game  which  is  won  once  out  of  four  times,  is  repeated  10  times. 
What  is  the  probability  of  winning  at  most  2  of  these? 

551124 

1048576  ' 

§  69.  It  often  occurs  that  we  inquire  in  a  general  way  concerning  a  probability, 
which  is  a  function  of  one  or  more  numbers.  Often  it  is  also  easier  to  transform  a  special 
problem  into  such  a  one  of  a  more  general  character,  where  the  unknown  is  a  whole  table 
Pv  pz,  ps,  ...  pn  of  probabilities,  the  suffixes  being  the  arguments  of  the  table.  And 
then  we  must  generally  work  with  implicit  equations,  f(p±,  •  •  •  pn)  =  0,  particularly  such 
as  hold  good  for  an  arbitrary  value  of  n,  i.  e.  with  difference-equations.  Integration  of 
finite  difference-equations  is  indeed  of  so  great  importance  in  the  art  of  solving  problems 
of  the  theory  of  probabilities,  that  we  can  almost  understand  that  Laplace  has  treated 
this  method  almost  as  the  one  to  be  used  in  all  cases,  in  fact  as  the  scientific  quintessence 
of  the  theory  of  probabilities. 

Since  finite  difference-equations  like  differential  equations  cannot  as  a  rule  be  inte- 
grated by  known  functions,  we  can  in  an  elementary  treatise  deal  only  with  the  simplest 
cases,  especially  such  as  can  be  solved  by  exponential  functions,  namely  the  linear  difference- 
equations  with  constant  coefficients.  As  to  these,  it  is  only  necessary  to  mention  here 
that,  when 

cmpn+m  +  . . .  +  cQpn  =  0    (n  being  arbitrary), 

the  solution  is  given  by 

pn  =  far"  -f-  .  .  .  -f  kmrm  ,  (127) 
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w lii-re  /'i,  ...  /•„,  are  the  roots  in  Ihr  ei|iialion 

Cm?™  -j-   •  •  •    -f  <*0     =0, 

while  /,,,  ...  &m  are  integration-constants  whenever  the  corresponding  roots  occur  singly; 
but  rational  integral  functions  with  arbitrary  constants,  and  of  the  degree  i  —  1 ,  if  the 
corresponding  root  occurs  i  times. 

I  shall  mention  one  other  means,  however,  not  only  because  it  can  really  lead 
to  the  integration  of  many  of  the  difference-equations  which  the  theory  of  probabilities 
leads  to,  particularly  those  in  which  the  exponential  functions  occur  in  connection  with 
binomial  functions  and  factorials,  but  also  because  it  has  played  an  important  part  in  the 
conception  of  this  book. 

The  late  Professor  L.  Oppermann,  in  April  1871,  communicated  to  me  a  method 
of  transformation,  which  I  shall  here  state  with  an  unessential  alteration. 

A  finite  or  infinite  series  of  numbers 


can  univocally  be  expressed  by  another: 


w2  -f    M3  -f    u4 


w    =         -  u 


6^4  -f-  •  •  • 


(128) 


where  the  sum  2'  may  be  taken  from  — oo  to  -foo,  provided  that  up  =  0  when  p>.n. 
In  order,  vice  versa,  to  compute  the  w's  by  means  of  the  w\  we  have  equations  of  just 
the  same  form: 


(129) 


Here,  as  in  (17)  and  (18),  the  general  dependency  between  the  M,  and  Wj  can  be 
expressed  in  a  single  equation,  be  means  of  an  independent  variable  z.  From  (129)  we 
get  identically 

If  we  here  put  1  —  tf  =  ev,  then  1— &  =  &  will  reduce  (128)  to  an  equation  of 
the  same  form. 
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If  Ui  is  the  frequency  or  probability  of  %  taken  as  an  observed  value,  then  also 


illustrate  the  relations  of  the  values  in  Oppermann's  transformation  to  the  half-invariants 
and  sums  of  powers.    In  particular  we  have 


w 


^  __ 
= 


3 
«3  =  —  o  —      f-b 


w, 


If  now  MO,MX,  ...  MB  are  a  series  of  probabilities  or  other  quantities  which  depend 
on  their  suffix  according  to  a  fixed  law,  and  if  we  know  this  law  only  through  a  difference- 
equation,  then  Oppermann's  transformation  of  course  leads  only  to  a  difference-equation 
for  w?0,  Wj,  ...  wn  as  function  of  their  suffix.  But  it  turns  out  that,  in  problems  of 
probabilities,  this  equation  pretty  often  is  easier  to  deal  with  than  the  original  one  (for 
instance  the  more  difficult  ones  in  Laplace's  collection  of  problems).  If  we  can  look  upon 
a  probability  w,  as  the  functional  law  of  errors  for  i  as  the  observed  value,  then  w  expresses 
the  same  law  of  errors  by  symmetrical  functions,  and  frequently  we  want  nothing  more. 
If  we  have  to  reverse  the  process  to  find  Ui  itself,  the  series  are  pretty  simple  if  w  is 
simple;  but  they  are  often  less  favourable  for  numerical  computation,  as  they  frequently 
give  the  unknown  as  a  difference  between  much  larger  quantities.  There  exists  a  means 
of  remedying  this,  but  it  would  carry  us  too  far  to  enter  into  a  closer  examination  of  the 
question  here. 

Example  1.  I  throw  a  die,  and  go  on  throwing  till  I  either  win  by  getting  "one" 
twice,  or  lose  by  throwing  "two"  or  "three".  If  the  game  is  to  be  over  at  latest  by  the 
wth  throw,  what  is  my  probability  of  winning?  If  the  number  of  throws  is  unlimited, 
what  is  the  probability  of  another  "one"  appearing  before  any  "two"  or  "three"? 

Four  results  are  to  be  distinguished  from  one  another.  At  any  throw,  say  the  ith, 
the  game  can  in  general  be  won,  lost,  half  won  (by  only  one  "one"),  or  drawn.  Let  the 
probability  of  the  *th  throw  resulting  in  a  win  be  p.,  of  the  same  resulting  in  a  loss  be  q., 
in  half  win  st,  and  in  a  drawn  game  be  r,-,  then  pt  =0,  ql  =  |,  st  =  £,  and  rv  =  1. 
Thus  the  probability  of  a  second  throw  is  f,  and,  generally,  the  probability  of  an  *th  throw 
s._M  +rj._1.  It  is  easy  to  express  p.^  q{,  /v,  and  st.  in  terms  of  ri_i  and  st._t,  and  also 


1L>7 

Pi-.p  7<_i'  *Y_i'  an<*  *,_!  in  terras  of  rf_f  and  st  _,,  etc.  By  elimination  then  the  difference- 
equations  can  be  found. 

When  we  replace  p  or  </  or  s  or  r  by  #  the  difference-equation  can  be  written  in 
llic  common  form 

Xi  —  Xi-i   -\-  ]#i-2    =    0, 

which  is  integrated  as 

xt  =  (a  +  bi)  2-<; 

for  r  we  have  the  simpler  form 

r,  =  |r<_i. 

When,  by  the  probabilities  of  the  first  throws,  we  have  determined   the  constants, 
we  get 

-  *'"""*  2-' 

Pi  -       9 


and 

r,  =  2-'. 

We  then  have  the  formulae  Pn  =  PI  -f-  .  .  .  -\-pn  and  Qn  =  ^1+  •  •  •  -f-  2n,  for  the 
probabilities  of  making  the  winning  or  losing  throw,  and  we  get 

Pn  1  .    (2»-l)-n  ^<*  1 

P«+^»~      3     3(2"-!)  —  n  Poo+^>oo~      9* 

Example  2.  In  a  game  the  probability  of  winning  is  tt.  The  same  game  is 
repeated  a  great  many,  w,  times.  If  it  then  happens  at  least  once  in  this  series  that  m 
successive  games  are  won,  you  get  a  prize.  What  is  the  probability  pn  of  this?  In  a 
game  of  dice,  where  TS  =  £  ,  what  is  the  probability  of  getting  a  series  of  5  "sixes"  in 
10000  throws? 

It  will  be  simplest  to  find  the  probability,  qr=  1  —  jpr,  that  the  prize  will  not  be 
got  in  the  first  r  repetitions.  The  difference-equation  for  this  is 

qr+m+i  -  Vr+m  +  (1  -  «)  *>m<lr  =  0  (a) 

or 

m_t+       4-®'»-2?r1  +  e3'»-19r    ==  0,    (b) 


where  (b)  is  the  first  integral  of  (a).     (As  well  as  (a)  we  can  directly  demonstrate  (b). 
How  ?).     Hence 

9r  ==  citf+---  +<w>;< 

where  ct,  .  .  .  cm  are  constants,  which  as  well  as  c0  =  0  must  be  determined  by  means  of 
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qQ  =  q^  =  ...==  qm_i  =  1,  ?m  =••  I  —  ®"1,  and  pi  to  ^M  are  the  roots  of  an  irreducible 
equation  of  the  mth  degree,  which  is  got  from 

pm+l  _  pm    ==    fljm+1  _  ^jm  (c) 

by  dividing  out  p  —  73.  The  largest  of  these  roots  (for  small  ra's  or  large  m's)  will  be 
only  a  little  less  than  1;  a  small  negative  root  occurs  when  m  is  even;  the  others  are 
always  imaginary,  and  they  are  also  small. 

In  the  actual  computation  it  is  highly  desirable  to  avoid  the  complete  solution  of  (c). 
This  can  be  done,  and  this  problem  will  illustrate  a  most  important  artifice.  We  may 
use  the  difference-equation  to  compute  a  single  value  of  the  unknown  function  by  means 
of  those  which  are  known  to  us  from  the  conditions  of  the  problem,  and  then  successive 
values  of  the  unknown  function  by  means  of  those  already  obtained;  here,  for  instance, 
(b)  enables  us  to  get  qm+i  in  terms  of  q^  . . .  qm.  Then  we  get  qm+z,  either  by  again  ap- 
plying (b)  to  ^3,  ...  qm+i  or  by  applying  (a)  to  qi  and  qm+i  (or  best  in  both  ways  for  the 
sake  of  the  check),  etc. 

It  is  evident  that  the  table  of  the  numerical  values  of  the  function  which  we  can 
form  in  this  way,  cannot  easily  become  of  any  great  extent  or  give  us  exact  information 
as  to  the  form  of  the  function.  But  we  are  able  to  interpolate,  and,  when  the  general 
form  of  the  function  is  known  (as  here),  we  may  be  justified  in  using  extrapolation  also. 
In  our  example  we  need  only  continue  the  computations  above  described  until  the 
term  in  qr  =  cipri-\-  . . .,  corresponding  to  the  greatest  root  p^  dominates  the  others  to 
such  a  degree  that  the  first  difference  of  Log  qr  becomes  constant,  and  the  computation  of 
qr  for  higher  indices  can  then  be  made  as  by  a  simple  geometrical  progression.  In  the 
numerical  case  qr  =  1-004078  x  (0-9998928)r ;  1  —  qmw  =  0-6577. 

Example  3.  A  bag  contains  n  balls,  a  white  and  a  —  n  black  ones.  A  ball  is 
drawn  out  of  the  bag  and  a  black  ball  then  placed  in  it,  and  this  process  is  repeated  y 
times.  After  the  iyth  operation  the  white  and  black  balls  in  the  bag  are  counted.  Find 
the  probability  ux(y)  that  the  numbers  of  white  balls  will  then  be  x  and  the  black 
ones  n  —  x. 

We  have 

u*(y}  =  -^My-l)+3     -ux+i(y-l) 

r(f  tv 

and 

ux(Q)  =  0,    except  ua(Q)  =  1. 

By  Oppermann's  transformation  we  find 

My)  =  (-l)'2fa(z)  •  ux(y] , 
2  taken  from  x=  -co   to  x=  -|-QO,  or 

•My)  =  (-\Y2^M^'^(y-^^(-^2^^-^i(z}'Ux+i(y-\). 
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The  limits  ni'  x  under  2'  being  infinite,  ar-f-1  can  be  replaced  by  x,  consequently 

w*(y)  —  -£-«>•(?—  1)« 

This  difference-equation,  in  which  y  is  the  variable,  may  easily  be  integrated.  As  we  have,  further, 

•MO)  -  (-1)'/M*)< 
we  get 

Wt(y)    =    (-!)•.£,(« 


By  Oppermann's  inverse  transformation  we  find  now: 

My)  --  (-i)T^(^)-(-i)^^( 

21  taken  from  x  =  —  oo   to  x  =  -\-<x>.    This  expression 


tins  the  above  mentioned  practical  short-comings,  which  are  sensible  particularly  if  n,  a  —  x, 
or  y  are  large  numbers;  in  these  cases  an  artifice  like  that  used  by  Laplace  (problem  17) 
becomes  necessary.  But  our  exact  solution  has  a  simple  interpretation.  The  sum  that  mul- 

tiplies fia  (x)  in  ux(y],  is  the  (a—  #)th  difference  of  the  function  (--—  j  ,  and  is  found  by  a  table 

c  ,,  In  —  a\v     In  —  a  -\-\\v  In—x  —  l\v    ln  —  x\y         a,     a.         !•«• 

of  the  values  ,  —  I  ,  ...   I  -  ,         —   ,  as  the  hnal  difference  formed  bv 

\nl\n  \nl\nl  J 

all  these  consecutive  values.  We  learn  from  this  interpretation  that  it  is  possible,  if  not 
easy,  to  solve  this  problem  without  the  integration  of  any  difference-equation,  in  a  way 
analogous  to  that  used  in  §  67,  example  4. 

If  we  make  use  of  wz(y)  to  give  us  the  half-invariants  ^x,  /^2  for   the  same  law 
of  errors  as  is  expressed  by  nx(y),  then  we  find  for  the  mean  value  of  x  after  y  drawings 


and  for  the  square  of  the  mean  error 


XVI.   THE  DETERMINATION  OF  PROBABILITIES  A  PRIORI 

AND  A  POSTERIORI. 

§  70.  The  computations  of  probabilities  with  which  we  have  been  dealing  in  the 
foregoing  chapters  have  this  point  in  common  that  we  always  assume  one  or  several  pro- 
babilities to  be  given,  and  then  deduce  from  them  the  required  ones.  If  now  we  ask,  how 

17 
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we  obtain  those  "given"  probabilities,  it  is  evident  that  other  means  are  necessary  than 
those  which  we  have  hitherto  been  able  to  mention,  and  provisionally  it  must  be  clear 
that  both  theory  and  experience  must  cooperate  in  these  original  determinations  of  proba- 
bilities. Without  experience  it  is  impossible  to  insure  agreement  with  reality,  and  without 
theory  in  these  as  well  as  in  other  determinations  we  cannot  get  any  firmness  or  exactness. 
In  determining  probabilities,  however,  there  is  special  reason  to  distinguish  between  two 
methods,  one  of  which,  the  a  priori  method  seems  at  first  sight  to  be  purely  theoretical, 
while  the  other,  the  a  posteriori  method,  is  as  purely  empirical. 

§  71.  The  a  priori  determination  of  probabilities  is  based  on  estimate  of  equality, 
inequality,  or  ratio  of  the  probabilities  of  the  several  events,  and  in  this  process  we  always 
assume  the  operative  causes,  or  at  any  rate  their  mode  of  operation,  to  be  more  or  less 
known. 

On  the  one  hand  we  have  the  typical  cases   in  which  we  know  nothing  else  with 
respect  to  the  events  but  that  each  of  them  is  possible,  and  in  the  absence  of  any  reason 
for  preferring  any  one  of  them   to  any  other,  we  estimate  them  to  be  equally  probable  - 
though  certainly  with  the  utmost   uncertainty.     For  instance:   What  is  the  probability   of 
seeing,  in  the  course  of  time,  the  back  of  the  moon?    Shall  we  say  |  or  §? 

On  the  other  hand  we  have  the  cases  —  equally  typical,  but  far  more  important  - 
in  which,  by  virtue  of  a  good  theory,  we  know  so  much  of  the  causes  or  combinations  of 
causes  at  work  that,  for  each  of  those  which  will  produce  one  event,  we  can  point  out 
another  (or  n  others)  which  will  produce  the  opposite  event,  and  which  according  to  the 
theory  must  occur  as  frequently.  In  this  case  we  must  estimate  the  probability  of  the 
result  at  -=-  and  —^  respectively,  and  if  the  conditions  stated  be  strictly  fulfilled,  such  a 

&  %  — T-  i 

determination  of  probability  will  be  exact. 

But  even  if  such  a  theory  is  not  absolutely  unimpeachable,  we  can  often  in  this 
way  obtain  probabilities,  which  are  so  nearly  exact  and  have  such  infinitely  small  mean 
errors,  that  we  may  very  well  make  use  of  them,  and  compute  from  them  values  which 
may  be  used  as  our  theoretically  given  probabilities.  We  are  not  more  strict  in  other 
kinds  of  computations.  In  astronomical  adjustment,  for  instance,  it  is  almost  an  established 
practice  to  consider  all  times  of  observation  as  theoretically  given.  Their  real  errors,  how- 
ever, will  often  give  occasion  to  sensible  bonds  between  the  observed  co-ordinates;  but  the 
fact  is  that  it  would  require  great  labour  to  avoid  the  drawback. 

Such  an  a  priori  determination  of  probabilities  is  particularly  applicable  in  games. 
For  it  is  essential  to  the  idea  of  a  game  that  the  rules  must  be  laid  down  in  such  a 
way  that,  on  the  one  hand  they  exclude  all  computation  beforehand  of  the  result  in  a 
particular  case,  while  on  the  other  hand  they  make  a  pretty  exact  computation  of  the  pro- 
babilities possible.  The  procedure  employed  in  a  game,  e.  g.  throwing  of  dice  or  shuffling 
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of  cuds,  ought  therefore  to  exclude  all  circumstances  that  might  permit  the  players  to  set 
causes  in  train,  which  could  bring  about  or  further  a  certain  event  (corriger  la  fortune). 
Hut  also  those  circumstances  ought  to  be  eliminated,  which  not  only  by  their  incalculability 
make  a  judgment  of  the  probabilities  very  insecure,  but,  above  all,  make  it  depend  on  the 
theoretical  insight  of  tho  parties.  Otherwise  the  game  will  cease  to  be  a  fair  game  and 
will  become  a  struggle.  The  so-called  stock-jobbing  is  rather  a  war  than  a  game. 

When  the  estimate  of  the  probabilities  depends  essentially  upon  personal  knowledge, 
we  speak  of  a  subjective  probability.  This  too  plays  a  great  part,  especially  in  daily  life. 
The  fear  which  ignorant  people  have  of  all  that  is  new  and  unknown,  proves  that  they 
understand  that  there  is  a  great  uncertainty  in  the  estimate,  and  that  it  is  greater  for 
those  who  know  but  a  little,  than  for  those  who  know  more  and  are  therefore  better  able 
to  judge. 

Roulette  may  be  taken  as  an  example  of  the  objective  probability  which  arises  in 

a  well  arranged  game.     A  pointer  turns  on  an  almost  frictionless  pivot  and  points  to  the 

scale  of  a    circle  whose  center  is   in  the  pivot.    The  pointer  is  made  to  revolve  quickly, 

and  the  result  of  the  game  depends  on  where  it  stops.   If  the  pointer  stops  opposite  a  space 

-  suppose  a  red  one  -  -  previously  selected  as  favourable,  the  game  is  won. 

There  we  have  as  essential  circumstances:  1)  the  length  of  the  arc  which  is  tra- 
versed, this  being  determined  by  the  initial  velocity  and  the  friction,  2)  the  initial  position, 
and  3)  the  manner  in  which  the  circle  is  divided. 

The  length  of  the  arc  is  unknown,  especially  when  we  take  care  to  exclude  very 
small  velocities,  and  when  the  friction,  as  already  mentioned,  is  very  slight.  So  much 
only  may  be  regarded  as  given,  that  the  frequency  of  a  given  length  of  the  arc  must,  as 
function  of  this  length,  be  expressed  by  a  functional  law  of  errors  of  a  nearly  typical  form. 
For  the  frequency  must  go  down,  asymptotically,  as  far  as  0,  both  below  and  above  limits 
of  the  arc  which  will  be  separated  by  many  full  revolutions  of  the  pointer,  and  with  at 
least  one  maximum  between  these  limits.  If  now,  for  instance,  it  depended  on,  whether 
the  arc  traversed  was  greater  or  smaller  than  a  certain  value,  the  apparatus  would  be  in- 
expedient, it  would  not  allow  any  tolerably  trustworthy  a  priori  estimate.  But  if  the 
winning  space  (or  spaces)  is  small  in  proportion  to  the  total  circumference  and,  moreover, 
repeated  regularly  for  each  of  the  numerous  revolutions,  then  the  a  priori  determination 
of  the  probabilities  will  be  even  very  exact.  For  an  area  ABab,  bounded  by  any  finite, 
continuous  curve  whatever  (in  the  present  case  the  curve  of  j$ 

errors  of  the  different  possible  events),  by  the  axis  of  abscissae, 
and  two  ordinates,  can  always  as  a  first  approximation  be 
expressed  as  the  sum  of  numerous  equidistant  small  areas 

pP,  qQ'  ....   with   a   constant   base,   multiplied  by  the       &pp'       <}<]'       rr'       *s' 

17* 
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interval  pq  =  qr  =  ...  and  divided  by  the  base  pp'  —  qq'  ==  ...  And  if  we  speak  of 
the  total  area  of  a  curve  of  errors,  then  the  series  of  which  the  first  term  is  this  approx- 
imation, is  even  very  convergent,  in  such  a  degree  as  6(x)  =  1-}-  #+  #4  +  #9+  #16+  . .  • 
for  small  #,  and  the  said  approximation  is  sufficient  for  all  practical  purposes. 

That  the  initial  position  of  the  roulette  is  unknown,  does  not  essentially  change 
the  result  of  the  foregoing,  viz.  that  the  probability  of  winning  is  — .  This  uncertainty 
can  only  cause  an  improvement  of  the  accuracy  of  this  approximation.  If  we  may  assume 
that  the  pointer  will  as  probably  start  from  any  point  in  the  circle  as  from  any  other,  this 
determination  — ^  will  even  be  exact,  without  any  regard  to  the  special  kind  of  the  un- 
known function  of  frequency. 

The  ratio  of  the  winning  space  on  the  circle  pp'  to  the  whole  circumference  pq, 
the  third  essential  circumstance,  cannot  be  determined  wholly  a  priori,  but  demands  a 
measurement  or  a  counting  whose  mean  error  it  is  essential  to  know. 

The  a  priori  determination  of  probability  can  thus,  according  to  circumstances, 
give  results  of  the  most  different  values,  from  the  very  poorest  through  gradual  transition 
up  to  such  exact  probabilities  as  agree  with  the  suppositions  in  §  65  seqq.,  and  permit  the 
probability  to  replace  the  whole  law  of  errors  for  our  predictions.  But  what  the  a  priori 
method  cannot  give,  is  a  quantitative  statement  of  the  uncertainty  which  affects  the 
numerical  value  of  the  probability  itself.  Only  when  it  is  evident,  as  in  the  example  of 
the  roulette,  that  this  uncertainty  is  infinitely  small,  can  we  make  use  of  a  priori  proba- 
bilities in  computations  that  are  to  be  relied  on.  If  in  the  work  and  struggles  of  our  life, 
we  cannot  entirely  avoid  building  on  altogeth  r  uncertain  and  subjective  a  priori  estimates, 
great  caution  is  necessary,  and  in  order  not  to  overdo  this  caution  for  want  of  a  proper 
measure,  we  must  try,  by  tact  or  experience,  without  any  real  method,  to  get  an  estimate 
of  the  uncertainty. 

Even  by  the  best  a  priori  determinations  of  probability  caution  is  not  superfluous; 
the  dice  may  be  false,  the  pivot  of  the  roulette  may  be  worn  out  or  bent,  and  so  on. 

§  72.  By  the  a  posteriori  determination  of  probabitity  we  build  on  the  law  of  the 
large  numbers,  inferring  from  a  law  of  actual  errors  in  the  form  of  frequency  to  the  law 
of  presumptive  errors  in  that  of  the  probability.  We  repeat  the  trial  or  the  observation, 
and  count  the  numbers  m  for  the  favourable  and  n  for  the  unfavourable  events. 

Owing  to  the  signification  of  a  probability  as  mean  value,  the  single  values  being 
0  for  every  unfavourable  event,  1  for  every  favourable  event,  the  probability  p  for  the  fa- 
vourable event  must  be  transferred  unchanged  from  the  law  of  actual  errors  to  that  of 
presumptive  errors;  consequently 

*  =  -!?-•  (130) 

m  --  n 
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Since,  according  to  the  same  consideration,   the  square  of  the  mean  deviation  loi 

S    8    -  •-  -  *J  ^  l/i'/i 

trial  is  -  .,  ,  ;ind  the  number  S 

•v  (w-r-tt)2 

I  In-  square  of  the  mean  errors  must,  according  to  (47),  be 


S    8    -  •-  -  *J  ^  l/i'/i 

a  single  trial  is  -  .,  ,  ;ind  the  number  SQ  of  the  repetitions  is  «=  w-fn, 

•v  w--tt2 


(m-i  //) 

which  is,  therefore,  the  square  of  the  mean  error  for  a  single  trial,  whether  this  is  one  of 
those  which  we  have  made,  or  is  a  repetition  which  we  are  still  to  make,  and  for  which 
we  are  to  compute  the  uncertainty. 

If  we  then   ask  for  the  mean  error  of  the  probability  p  =-  -      —  ,   got  from  the 

m  -f-  n 

m  -\-  n  repetitions,  we  have 

I  (V\  -  ~  mn  -  =  -^l1—  l>)  (132) 

(w  +  n)*(ro  +  n  —  1)        w  +  w-  1 

as  the  square  of  this  mean  error. 
The  identity 

mn        .  mn  mn 

r\2  "T 


(w-f-w)2   '   (m-\-n)*(m  +  n  —  1)        (m  +  n)  (m  +  n  —  1) 
or 

PZ  +  lt(P)  =  *« 

shows  that  the  mean  error  at  a  single  trial,  when  the  probability  p  is  determined  a  poste- 
riori by  m  -}-  n  repetitions,  can  be  computed  by  (34),  as  originating  in  two  mutually  free 
sources  of  errors,  one  of  which  is  the  normal  uncertainty  belonging  to  the  probability,  for 
which  ^2  =  pq  (123),  while  the  other  is  the  inaccuracy  of  the  a  posteriori  determination, 
for  which  kz(p)  is  the  square  of  the  mean  error. 

The  a  posteriori  determination  therefore  never  gives  an  exact  result,  but  only  an 
approximation  to  the  probability.  Only  when  the  number  of  repetitions  we  employ  is  so 
large  that  their  reduction  by  a  unit  may  be  regarded  as  insignificant,  we  can  immediately 
employ  the  probabilities  found  by  means  of  them  as  complete  expressions  for  the  law  of 
errors.  But  even  by  the  very  smallest  number  of  repetitions  of  the  trial,  we  not  only  obtain 
some  knowledge  of  the  probability,  but  also  a  determination  of  the  mean  error,  which  may 
be  useful  in  predictions,  and  may  serve  as  a  measure  of  the  caution  that  is  necessary.  It 
must  be  admitted  that  it  is  not  such  a  simple  thing  to  employ  these  mean  errors  as  those 
in  the  ideal  theory  of  probability,  but  it  is  not  at  all  difficult. 

As  above  mentioned,  the  a  posteriori  determination  of  probability  seems  to  be  purely 
empiric;  theory,  however,  takes  part  in  it,  but  is  concealed  in  the  demand,  that  all  the 
trials  we  make  use  of  must  be  repetitions,  in  the  same  way  as  the  future  trials  whose 
results  and  uncertainty  are  predicted  by  the  a  posteriori  probabilities.  Transgressions  of 
this  rule,  which  reveal  themselves  by  unsuccessful  predictions,  are  by  no  means  rare,  and 
compel  statistics  and  the  other  sciences  which  work  with  probabilities,  to  many  alterations 
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of  their  theories  and  hypotheses,  and  to  the  division  of  the  materials  obtained  by  trial  into 
more  and  more  homogeneous  subdivisions. 

Example.  A  die  is  inaccurate  and  suspected  of  being  false.  On  trial,  however, 
we  have  on  throwing  it  126  times  got  "six"  exactly  21  times,  and  so  far,  all  is  right. 
The  probability  of  "six"  is  found,  consequently,  to  be  p  =  ~  --=  -J-;  the  square  of  the 

151  1 

mean  error  is   X^(p)  =  y'Tf'Jgj  ^  999  5  the    limits  indicated   by  the  mean  errors   are 
consequently  -^-4-  5^,  or  ~  and  ir. 

b  —  oU  15  5 

If  now  we  seek  the  probability  that   we   shall   not  get  "six"  in   6   throws,    the- 


probability  is  still  as  by  an  accurate  die  (1  —p)e  =          =  —  +  ...,  but  what  is  now  the 


mean  error?  Ideally,  its  square  should  be  (1  —  p)6  (1  —  (1  —  p)6)  =  J-_|_...  But  if  p 
can  have  a  small  error  dp,  the  consequent  error  in  (1  —  p)G  will  be  —  6(1—  p)5dp;  if 
then  the  square  of  the  mean  error  of  p  is  =  ^  ==  p  (1  —  p)—  —  -,  the  total  square  of 
the  mean  error  of  the  probability  of  not  getting  "six"  in  6  throws  will  be 


_ 
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In  every  single  game  of  this  sort  the  mean  error  is  therefore  only  slightly  larger  than  with 
an  accurate  die,  but  its  actual  value  is  so  large  (nearly  i)  as  to  call  for  so  much  caution 
on  the  part  both  of  the  player  and  of  his  opponent,  that  there  is  not  much  chance  of 
their  laying  a  wager.  This  may  be  remedied  by  stipulating  for  a  large  number  of  repeti- 
tions of  the  game.  Let  us  examine  the  conditions  if  we  are  to  play  this  game  of  making 
6  throws  without  "six"  72  times.  With  the  above  approximate  fractions  there  will  be 
expectation  of  winning  72  •  |  =  24  games.  In  the  computation  of  the  square  of  the 
mean  error  of  this  result,  the  first  term  in  the  above  ^2  must  be  multiplied  by  72,  but 
the  second  by  722;  hence 


=  16  +  33  =  49. 

The  mean  error  will  be  about  7,  while  it  would  only   have  been  4,   if  the  die  had   been 
quite  trustworthy. 

§  73.  We  have  mentioned  already,  in  §  66,  the  skewness  of  the  laws  of  errors 
which  is  peculiar  to  all  probability.  It  does  not  disappear,  of  course,  in  passing  from  the 
law  of  actual  errors  to  that  of  presumptive  errors,  and  in  the  a  posteriori  determination  of 
probability  it  produces  what  we  may  call  the  paradox  of  unanimity:  if  all  the  repetitions 
we  have  made  agree  in  giving  the  same  event,  the  probability  deduced  from  this,  a  poste- 
riori, must  not  only  be  1  or  0,  but  the  square  of  the  mean  error  J..,  (p)  of  these  determi- 
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nations  (as  well  as  the  higher  half-invariants)  becomes  —  >0.  Must  we  infer  then,  respective!)  . 
to  certainty  or  to  impossibility,  only  because  a  smaller  or  greater  number  of  repetitions 
mutually  agree?  must  we  consider  a  unanimous  agreement  as  a  proof  of  the  absolute 
correctness  of  that  which  is  thus  agreed  upon?  Of  course  not;  nor  can  this  inference  be 
iiiiiiiituiiK'd,  if  we  look  more  closely  at  the  law  of  errors  /./2  =  0,  //»  =  0,  .  .  .  /^  =  0. 

Such  a  law  of  errors,  to  be  sure,  may  signify  certainty,   but  not  when,   as  here,  the  ratio 

• 
/tn:ftl  =  oo.   A  law  of  errors  which  is  skew  in   an   infinitely  high  degree,   must  indicate 

something  peculiar,  even  though  the  mean  error  be  ever  so  small.  Add  to  this  that  it  is 
not  a  strict  consequence  in  practical  calculations  that,  because  the  square  of  a  number, 
here  that  of  the  mean  error,  is  =  0,  the  number  itself  must  be  =  0,  but  only  that  it 
must  be  so  small  that  it  may  be  treated  as  a  differential,  which  otherwise  is  indeterminate. 
The  paradox  being  thus  explained,  it  follows  that  no  objections  against  the  use  of  a 
posteriori  probabilities  in  general  can  be  based  on  it.  But  it  must  warn  us  to  be  cautious 
in  computations  with  such  probabilities  as  observed  values,  where  the  computation,  as  the 
method  of  the  least  squares,  presupposes  typical  laws  of  errors.  For  this  reason,  we  must 
for  such  computations  reject  all  unanimously  or  nearly  unanimously  determined  probabilities 
as  unsuitable  material  of  observation.  Another  thing  is  that  we  must  also  reject  the 
hypothesis  or  theory  of  the  computation,  if  it  does  not  explain  the  unanimity.  As  an 
example  we  may  take  an  examination  of  the  probability  of  marriage  at  different  ages.  The 
a  posteriori  statistics  before  the  c.  20th  year  and  after  the  c.  60th  must  not  be  used  in  the 
computation  of  the  sought  constants  of  the  formula,  but  the  formula  can  be  employed  only 
when  it  has  the  quality  of  a  functional  law  of  errors  so  that  it  approaches  asymptotically 
towards  0,  both  for  low  and  high  ages. 

The  paradox  of  unanimity  has  played  rather  a  considerable  part  in  the  history  of 
the  theory  of  probabilities.  It  has  even  been  thought  that  we  ought  to  compute  a  poste- 
riori probabilities  by  another  formula 

^"  (Bayes'8  Rule)  (134) 


and  not,  as  above,  by  the  formula  of  the  mean  number 

m 

p  =  —  —  a 

m  -{-;  n 

The  proofs  some  authors  have  tried  to  give  of  Bayes's  rule  are  open  to  serious  objections. 
In  the  "Tidsskrift  for  Mathematik"  (Copenhagen,  1879),  Mr.  Bing  has  given  a  crushing 
criticism  of  these  proofs  and  their  traditional  basis,  to  which  I  shall  refer  those  of  my 
readers  who  take  an  interest  in  the  attempts  that  have  been  made  to  deduce  the  theory 
of  probabilities  mathematically  from  certain  definitions. 
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Bayes's  rule  has  not  been  employed  in  practice  to  any  greater  extent,  particularly 
not  in  statistics,  though  this  science  works  entirely  with  a  posteriori  probability.  But  as 
it  makes  the  paradox  of  unanimity  disappear  in  a  convenient  way,  and  as,  after  all,  we 
can  neither  prove  nor  disprove  the  exact  validity  of  a  formula  for  the  determination  of 
an  a  posteriori  probability,  any  more  than  we  can  do  so  for  any  transition  whatever  from 
the  law  of  actual  errors  to  that  of  presumptive  errors,  the  rule  certainly  deserves  to  be 
tested  by  its  consequences  in  practice  before  we  give  it  up  altogether.  The  result  of  such 
a  test  will  be  that  the  hypothesis  that  Bayes's  rule  will  give  the  true  probability,  can 
never  deviate  more  than  at  most  the  amount  of  the  mean  error  from  the  result  of  the 
series  of  repetition,  viz.  that  m  events  out  of  m  -j-  n  have  proved  favourable.  In  order  to 
demonstrate  this  proposition  we  shall  consider  a  somewhat  more  general  problem. 

If  we  assume  that  trials  have  been  previously  made  which  have  given  fi  favourable, 
v  unfavourable  events,  and  that  we  have  now  in  continuing  the  trials  found  m  favourable 
and  n  unfavourable  events,  then  the  probability,  being  looked  upon  as  the  mean  value,  is 
determined  by 

p  _  -   ^±£-,  ,135) 

m  -f  n  -f-  ft  +  v 

of  which  Bayes's  formula  is  the  special  case  corresponding  to  //  =  v  =  1.  Bayes's  rule 
would  therefore  agree  with  the  general  rule,  if  we  knew  before  the  a  posteriori  determination 
so  much  of  the  probability  of  both  cases,  as  a  report  of  one  earlier  favourable  event  and 
one  unfavourable  event. 

In  the  more  general   case   the  square  of  the  mean  error  at  the  single  trial  is  now 


;u  = 


(m  -\-  ft)  (n  +  v) 


(m  -f-  n  -{-  fi.  -\-  v)  (m  -f-  n  -f-  p  -j-  v  —  1)  ' 
and  for  the  m-\-n  trials  is 

^2  (m  4~  n)  =  (m  ~h  n)  Az- 

If  we  now  compare  with   this   the  square  of  the  deviation  between  the  new  ob- 
servation and  its  computed  value,  that  is,  between  m  and  (m-\-n)p,  we  find 

(m  —  (m  -\-  n)  p)*  _  (^n  —  v  m)z  m  -\-  n  -\-  p  -\-  v  —  1 

Az(m-\-n)  (m  -j-  p.)  (n  +  i/)  (m  -j-  n)       m  -\-  n  -f  p  -(-  v 


(j.  -j-  v      m-}-n}\m-{-p      n-\-v)      m  -f-  n 

It  appears  at  once  from  the  latter  formula  that  the  greatest  imaginable  value  of  the  ratio  is 
the  greatest  of  the  two  numbers  //  and  v.  In  Bayes's  rule  p  =  v  =  1.  Here,  therefore,  1  is 
the  absolute  maximum  of  the  ratio  of  the  square  of  deviation  to  that  of  the  mean  error. 
With  respect  to  Bayes's  rule  the  postulated  proposition  is  hereby  demonstrated.  But  ^at 
the  same  time  it  will  be  seen  that  we  can  replace  Bayes's  rule  by  a  better  one,  if  there  is 


137 

only  an  a  priori  determination,  however  uncertain,  of  the  probability  we  are  seeking.  If 
we  take  the  a  priori  probabilities  &  for,  and  (1  —  ?3)  against,  instead  of  n  and  i/,  so  that 

w  -4-  BJ 

P  —  -  rr»-  (137) 

w-|-n-(-  1 

then  we  are  certain  to  avoid  the  paradox  of  unanimity  where  it  might  do  harm,  without 
deviating  so  much  as  the  mean  error  from  the  observation  in  the  a  posteriori 
determination. 

Neither  Bayes's  rule  nor  this  latter  one  can  be  of  any  great  use;  but  we  can  always 
employ  them,  when  the  found  probabilities  can  be  looked  upon  as  definitive  results.  On 
the  other  hand,  the  formula  of  the  mean  value  may  be  used  in  all  cases,  if  we  interpret 
the  paradox  of  unanimity  correctly.  Where  the  found  probabilities  are  to  be  subjected  to 
adjustment,  the  latter  formula,  as  I  have  said,  must  be  employed;  nor  can  the  other  rules 
be  of  any  help  in  the  cases  where  observed  probabilities  have  to  be  rejected  on  account 
of  the  skewness  of  the  law  of  errors. 


XVII.   MATHEMATICAL  EXPECTATION  AND  ITS  MEAN  ERKOR. 

§  74.  Whether  the  theory  of  probability  is  employed  in  games,  in  insurances,  or 
elsewhere,  in  all  cases  nearly  in  which  we  can  speak  of  a  favourable  event,  the  prediction 
of  the  practical  result  is  won  through  a  computation  of  the  mathematical  expectation. 
The  gain  which  a  favourable  event  entails ,  has  a  value ,  and  the  chance  of  winning 
it  must  as  a  rule  be  bought  by  a  stake.  The  question  is :  How  are  we  to  compare 
the  value  of  the  latter  with  that  of  which  the  game  gives  us  expectation?  Imagine  the 
game  to  be  repeated,  and  the  number  of  repetitions  A7  to  become  indefinitely  large,  then 
it  is  clear,  according  to  the  definition  of  probability,  that  the  sum  of  the  prizes  won,  if 
each  of  them  is  F,  must  be  pNV,  when  p  indicates  the  probability.  The  gain  to  be 
expected  from  every  single  game  is  consequently  pV,  and  this  product  of  the  probability 
and  the  value  of  the  prize  is  what  we  call  mathematical  expectation. 

The  adjective  "mathematical"  warns  us  not  to  consider  pV  as  the  real  value  which 
the  possible  gain  has  for  a  single  player.  This  value,  certainly,  depends,  not  only  objectively 
on  the  quantity  of  good  things  which  form  the  prize,  but  also  on  purely  subjective  circum- 
stances, among  others  on  how  much  the  player  previously  possesses  and  requires  of  the  same 
sort  of  good  things.  An  attempt  which  has  been  made  to  determine  by  means  of  what  is 
called  the  "moral  expectation",  whether  a  game  is  advantageous  or  not,  must  certainly  be 
regarded  as  a  failure.  For  it  takes  into  account  the  probable  change  in  the  logarithm  of 
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the  player's  property,  but  it  does  not  take  into  consideration  his  requirements  and  other 
subordinate  circumstances.  We  shall  not  here  try  to  solve  this  difficulty. 

It  is  evident,  with  respect  to  the  mathematical  expectation,  that  if  we  play  several 
unbound  games  at  the  same  time,  the  total  mathematical  expectation  is  equal  to  the  sum 
of  that  of  the  several  games.  The  same  is  the  case,  if  we  play  a  game  in  which  each 
event  entitles  the  player  to  a  special  (positive  or  negative)  prize.  In  this  latter  case  we 
speak  of  the  total  mathematical  expectation  as  made  up  of  partial  ones. 

Example  1.  We  play  with  a  die  in  such  a  way  that  a  throw  of  1  or  2  or  3  wins 
nothing;  a  throw  of  4  or  5  wins  2  s.,  and  one  of  6  wins  8  s.  The  total  mathematical 
expectation  is  then  \  x  0  -}-  £  x  2  -f-  -J  x  8  =  2  s.  A  stake  of  2  s.  will  consequently 
correspond  to  an  even  game.  We  might  also  deduce  the  2  s.  throughout,  so  that  a  throw 
of  1,  or  2,  or  3,  causes  a  loss  of  2s.  and  a  throw  of  6  a  gain  of  6s.;  the  total  mathe- 
matical expectation  then  becomes  =  0. 

Example  2.    In  computations   of  the  various  kinds   of  life-insurances  the  basis  is 

1)  the  table  of  the  number  of  persons   I  (a)  living   at  a  given  age   a.    The   probability   of 

v   •  •  Ifa-^-ac)        c  i  •     n   .          .11  .  I  (a)  —  l(a  -j-  x) 

such  a  person  living  x  years  is  =     v    r  ;  ,  of  his  dying  within  x  years  =  -       u  \         ' 

of  his   dying   at  the   exact  age  of  a-\-x  years  =  ---  /7      /     ^' 


/7  i  / 
other  necessary  probabilities  may  be  found  ;  2)  the  rate  of  interest  ^>,  which  serves  for  the 

valuation  of  future  payments  of  capital,  (\.-\-p)-xV,  or  annuities  certain  (1  —  (!-[-/>)"*)  —  • 
The  value  of  an  endowment  of  capital,   F,  payable  in  x  years,  if  the  person  who 
is  now  a  years  old  is  then  alive,  is  thus  equal  to  the  mathematical  expectation 


__  y  , 

-  D(a) 


which,  as  we  see,  is  most  easily  computed  by  means  of  a  table  of  the  function 

D(z)  --  l(z)(\+Pr: 

Such  a  table  is  of  great  use  for  other  purposes  also. 

The  value  of  an  annuity,  «;,  due  at  the  end  of  every  year  through  which  a 
person  now  a  years  old  shall  live,  can  be  computed  as  a  sum  of  such  payments,  or  by 
the  formula 

a:  ==oo  x  =  oo 

W,  (139) 


where  I  (  GO  )  =  0  and  D  (  <x>  )  =  0. 

But  it  deserves  to  be  mentioned  that  this  same  mathematical  expectation  is  most 
safely  looked  upon  as  a  total  mathematical  expectation  in  a  game  whose  events  are  the  various 

possible  years  of  death  ;  the  probability  of  death   in   the  first  year  being    *(a)~*(flt+1)  ^  jn 

l  (a) 
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the  second  ]   (         --  ,  and  so  on  ;  while  the  corresponding  values  are  annuities  certain 

of  />  for  varying  duration.     In  this  way   we  find  for  the  value  of  the  life-annuity   the 
expression 


).  (140) 

Sm.  i>  the  sum   2(l(a-\-x)  —  l(a-\-x-{-l))  •=»  /(a),  we  find  by  solution  of  the  last  paren- 

o 
thesis  that  the  expression  may  be  written 

X  =  00 

7-7  4  J^'(«+*>  -<<«+*+!»  t^)-*, 

and  this  shows  that  the  value  of  the  life-annuity  is  the  difference  between  the  capital  sum 
of  which  the  yearly  interest  is  v  and  the  value  of  a  life-insurance  of  --  payable  at  the 
beginning  of  the  year  of  death. 

In  life-insurance  computations  integrals  are  often  employed  with  great  advantage, 
instead  of  the  sums  we  have  used  here;  periodical  payments  (yearly,  half-yearly,  or  quarterly) 
being  reduced  to  continuous  payments,  and  vice  versa. 

§  75.  That  mathematical  expectation  is  not  a  solid  value,  but  an  uncertain  claim, 
is  expressed  in  the  law  of  errors  for  the  mathematical  expectation,  and  particularly  in  its 
mean  error  ;  for,  owing  to  the  frequent  repetitions  and  combinations  in  games  and  insurances, 
it  does  not  matter  much  that  the  isolated  laws  of  errors,  here  as  for  the  probabilities,  are 
often  skew.  If  the  value  V  is  given  free  of  error,  the  square  of  the  mean  error  of  the 
mathematical  expectation,  H  =pF,  is,  according  to  the  general  rule,  to  be  computed  by 

L2(H)  -p(l-p)V*.  (141) 

If  there  are  N  repetitions  of  the  same  game  we  get 

H'  =  pNV 
and 

*,(#')  =  p(l-p)NV;  (142) 

and  for  the  total  expectation  of  mutually  free  games,  H"  =  ZpiNiVi,  we  have 

lt(H")  =  -  2pi(\—pi)NiV?.  (143) 

By  free  games  we  may  pretty  safely  understand  such  as  are  not  settled  by  the  various 
events  of  the  same  trial  or  game.  (As  to  these,  see  §  76.) 

The  mean  error  is  excellently  adapted  for  computing  whether  we  ought  to  enter 
upon  a  proposed  game,  or  how  highly  we  are  to  value  uncertain  claims  or  outstanding 
balance  of  accounts.  Such  things  of  course  are  regulated  by  the  boldness  or  caution  of 
the  person  concerned;  but  even  the  most  cautious  man  may  under  fairly  typical  circum- 
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stances  be  contented  with  diminishing  the  value  of  his  mathematical  expectation  by  4 
times  the  amount  of  the  mean  error,  and  it  would  be  sheer  foolhardiness,  if  a  passionate 
player  would  venture  a  stake  which  exceeded  the  mathematical  expectation  by  the  quadruple 
of  its  mean  error.  On  the  other  hand,  a  simple  subtraction  or  addition  of  the  mean  error 
cannot  be  counted  a  very  strong  proof  of  caution  or  boldness  respectively. 

Example  1.  A  game  is  arranged  in  such  a  way  that  the  probability  of  winning 
from  the  person  who  keeps  the  bank  is  ^,  the  prize  is  8  $.  In  n  games  the  mathematical 
expectation  with  mean  error  is  then  (0-8  n  4;  2-4  l/n)  £•  If  the  banker  has  no  property, 
but  may  expect  144  games  to  be  played  before  the  prizes  are  to  be  paid,  he  cannot  without 
imprudence  estimate  his  negative  mathematical  hope,  his  fear,  lower  than  O8x  144 -(-2'4  x  12 
=  144^.  He  must  consequently  fix  the  stake  for  each  game  at  about  one  dollar,  and  will 
thus  stand  a  chance  of  seeing  the  bank  broken  about  once  in  six  times.  If,  however,  he 
has  got  so  much  capital  or  credit,  as  also  so  many  customers,  that  he  can  play  about 
2304  games,  his  business  will  become  very  safe;  the  average  gain  of  20  cts.  per  game  is 
460  8  80  cts.,  or  exactly  4  times  as  great  as  the  mean  error  2  8  40  cts.  x  48.  But  who 
will  enter  upon  such  a  game  against  the  banker  (a  game,  after  all,  which  is  not  worse 
than  so  many  others)?  The  very  stake  is  already  greater  than  the  mathematical  expectation; 
every  prudent  regard  to  part  of  the  mean  error  will  only  augment  the  disproportion. 
No  prudent  man  will  enter  upon  such  a  game,  unless  he  can  thereby  avoid  a  greater  risk: 
in  this  way  we  insure^  our  risks,  because  it  is  too  dangerous  to  be  "one?s  own  insurer". 
If  the  game  is  arranged  in  such  an  entertaining  way  that  we  pay  40  cts.  for  the  excitement 
only  of  taking  part  in  every  game,  then  even  rather  a  cautious  person  may  also  con- 
tinue for  144  games,  the  mean  error  (-[- 2-4 1/144  as  above)  being  only  28  j/t  80  cts.  or 
144(0-8  —  (1-0— 0-4))  $.  For  a  poor  fellow,  who  has  only  one  dollar  in  his  pocket,  but 
who  must  for  some  reason  necessarily  get  8  $,  such  a  game  may  also  be  the  best  resource. 
But  if  a  man  owns  only  2304  8,  and  fails  if  he  cannot  get  8  times  as  much,  then  he 
would  be  exceedingly  foolhardy  if  he  played  2304  times  or  more  in  that  bank.  If  we  must 
run  the  risk,  we  can  do  no  better  than  venturing  everything  on  one  card;  if  we  distribute 
our  chances  over  n  repetitions,  then  we  must,  beyond  the  mathematical  expectation,  hope 
for  ]/n  times  that  part  of  the  mean  error  which  might  help  by  the  one  attempt. 

Example  2.  Two  fire-insurance  companies  have  each  insured  10,000  farms  for  a 
total  insurance  of  f  10,000,000.  The  yearly  probability  of  damage  by  fire  is  j^,  and 
both  must  every  year  spend  £  5000  on  management.  Both  have  sufficient  guaranty-fund 
to  rest  satisfied  with  one  single  mean  error  as  security  against  a  deficit  in  each  fiscal  year. 
How  high  must  either  fix  its  annual  premium,  when  there  is  the  difference  that  the  com- 
pany A  has  10,000  risks  of  £1,000,  while  B  has  insured: 
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n 

100  farms  for  f  10,000 

400       « 
1,500       » 
2,500 
2,000 
1,500 
2,000 


a 

III! 

n  ii- 

0,000 

£  1,000,000 

10,000  x(10)6 

5,000 

..  2,000,000 

10,000 

2,000 

..  3,000,000 

6,000  ,  » 

1,000 

..  2,500,000 

2,500 

500 

•  1,000,000 

500 

200 

..   300,000 

60 

100 

..   200,000 

20 

£  10,000,000 

29,080  x(10)6 

10,000  farms 

Since  p(\  — p)  =  0-000999,  the  mathematical  expectation  -t  its  mean  error  is  in  the  case 
of  A  =  £  100,000  ±  £3,161,  in  the  case  of  B  =  £  10,000  ±  £5,390;  the  premiums 
are  therefore  £1  16s.  4  d.  and  £2  7s.  10  d.  respectively  for  £1,000;  i.e.  B  must  reinsure 
part  of  its  risks. 

§  76.  The  mean  error  and,  in  general,  the  law  of  error,  of  the  total  mathematical 
expectation  for  mutually  bound  events  which  may  be  considered  co-ordinate  events  of  the 
same  trials,  are  computed  in  half-invariants  by  means  of  the  sums  of  powers.  If  the  trial 
can  have  n  various  events,  of  which  the  one  whose  probability  is  pi  entails  a  gain  of  the 
value  a,,  and  we  imagine  the  same  repeated  a  sufficiently  large  number  of  times  (N  times), 

the  account  will  show: 

«x  occurring  plN  times, 


Hence 


an  occurring  pnN  times. 


s    = 


and  the  half-invariants  for  the  single  trial  will  be 

^i  =  =  P\a\  +  •  •  •  -\-pndn  =  the  total  mathematical  expectation  ==  H(\,  .  .  .  n)  ; 
lt(H(l,  .    .  n))  ==  p^^-i-  .  .  .  +pn«n2  -  (p,a,  +  .  .  .  +^«an)2.  (144) 

By  this  formula,  therefore,  we  must  in  such  cases  compute  the  square  of  the  mean 
error  of  the  total  mathematical  expectation  for  the  single  trial.  For  the  square  of  the 
mean  error  of  the  expectation  from  N  trials  we  have  consequently 

lt(N,H(l,  ...n))  =-  N(plal*+...-\-p&u*-H(l,...nF).  (145) 

By  even  game  we  understand  a  game  where  the  total  mathematical  expectation 
is  0;  the  last  term  of  this  formula  will  consequently  disappear  in  such  a  game.  As  the 
mean  error  does  not  depgnd  on  the  absolute  values  of  the  gains  or  losses,  but  only  on 
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their  differences,  we  may  in  the  computation  of  the  squares  of  the  mean  errors  reduce  to 
even  game  by  subtracting  the  mathematical  expectation  from  all  the  gains,  and  adding  it 
to  the  losses.  Thus  we  may  write: 

lt(N,H(l,  ...n))  =  N(pl(al—B(l,...n))*+...+pm(am-H(l,...nW).      (146) 
This  rule  then  differs  from  the  rule  of  unbound  games  only  in  the  absence  of  the  factors 

(1-pJ,     ...     (1-pj. 

We  can  now  compute  the  mean  errors  in  the  examples  1  and  2,  in  §  74.  la 
No.  1  we  have 

**(H)  =  -MO)2  +  i(2)2  +  !(8)2-22  = 

=  l(-2)2  + i(0)2  +  A(6)2       =  8. 

In  the  life-annuity  example  we  now  see  the  advantage  of  using  the  longer  formula 
(140)  for  the  value  of  the  annuity,  rather  than  the  formula  (139)  which  gives  the  value  as 
the  sum  of  a  number  of  endowments;  for  the  partial  expectations  are  here  not  unbound, 
and  only  the  deaths  in  the  several  years  of  age  exclude  one  another  and  can  be  considered 
co-ordinate  events  in  the  same  game.  For  the  square  of  the  mean  error  of  tha  life-annuity 
we  have,  from  (144): 


>    Yla      x  -la      x     1      1-  1+     -^ 
*  W°  +/0>      >' 

§  77.  In  the  above  studies  on  the  mean  errors  of  mathematical  expectations  we 
have  supposed  that  the  probabilities  we  use  are  free  from  error,  being  either  determined 
a  priori  by  good  theory  or  found  a  posteriori  from  very  large  numbers  of  repetitions.  This 
determination  is  not  complete  in  the  cases  in  which  the  probabilities  determined  a  posteriori 
are  found  only  by  small  numbers  of  trials,  or  if  probabilities  computed  a  priori  presuppose 
values  observed  with  sensibly  large  mean  errors.  The  same  warning  must  be  taken  with 
respect  to  other  values  which  may  enter  into  the  computed  mathematical  expectations;  the 
value  of  the  gains,  for  instance,  may  depend  on  the  future  rate  of  interest.  Whether  some 
of  the  manifold  sources  of  errors  are  to  be  omitted  in  a  computation  of  the  mean  error, 
or  not,  must  for  each  special  case  depend  on  the  relative  smallness  of  their  parts  of  the 
total  /12.  As  to.the  theory  of  probability  it  is  characteristic  only  that  the  parts  of  the  squares 
of  the  mean  errors,  considered  in  §§  75  and  76,  are,  as  a  rule,  very  important,  while  the 
analogous  parts  in  other  problems  are  often  insignificant.  When  the  orbit  of  a  planet  is 
computed  by  the  method  of  the  least  squares,  then,  in  order  to  restrict  the  limits  of 
research  for  its  next  discovery,  we  have  to  compute  the  mean  errors  of  its  co-ordinates 


11.; 

ill  Hie  next  opposition.  Ordinarily  these  mean  errors  are  so  large  that  the  /12  for  its  future 
observations  may  be  wholly  omitted,  though  this  >12  is  analogous  to  those  from  §§  75  and 
Tt'i.  15 ui  when  we  have  computed  a  table  of  mortality  by  the  method  of  the  least  squares, 
we  can  certainly  find  by  that  method  the  mean  error  1//12  (p)  of  the  probability  of 
life  computed  from  the  table;  but  if  we  are  to  predict  anything  as  to  the  uncertainty 
\\ilh  regard  to  n  lives,  and  with  regard  to  the  corresponding  mathematical  expectation  npa, 
then  we  must  not,  unless  n  is  very  great,  take  the  mean  error  as  na VAZ  (p) i  bat  we 
must,  as  a  rule,  first  take  /12  (H)  in  consideration,  and  consequently  use  the  formula 
p).  (Comp.  example,  §72). 
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